Seattle, WA
December 10–13, 2018
Click Here for More Information & Registration
View Venue Map

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Case Studies [clear filter]
Tuesday, December 11


Cloud Jumping with Kubernetes - James Bowes, Manifold
Kubernetes promises a degree of freedom from cloud vendor lock-in, providing an API and runtime for containers that most cloud providers support. Beyond the core of container orchestration, there are abstractions for load balancers, persistent storage and autoscaling, to name a few. With this in mind, when Manifold set out to switch cloud providers, we decided to do so by first using as many abstractions in the Kubernetes ecosystem as possible.

In this talk, we will explore how Manifold used Kubernetes to migrate between cloud providers. We will look at how to identify cloud specific services and APIs in use in your application, and how Manifold evaluated whether to uplift to a Kubernetes abstraction, translate to the other cloud provider’s service, or keep as is. Finally, we will discuss the challenges and successes of the actual migration between cloud providers.

avatar for James Bowes

James Bowes

Technical Lead, Manifold
James Bowes is the Technical Lead at Manifold. Over his 13 year career he has worked for companies like Red Hat and Salesforce as a senior member of the technical staff. James has scaled early stage startups and also managed and sustained large organizations. James is mainly a backend... Read More →

Tuesday December 11, 2018 10:50am - 11:25am
4C 1/2


Container Platforms as Equalizers: Running Health Services Across the World - Jamie Hewland, Praekelt.org
Praekelt.org creates and operates a number of health and youth-related services which are hosted on containerised clusters around the world, often in countries without an established cloud provider presence. This means that the infrastructure reliability and tooling that may typically be available are not. In addition, as a small team managing clusters in several isolated datacenters around the world, achieving commonality is challenging.

While we started using container orchestration because we wanted to increase resource utilisation and deployment agility, we have found the real value has been in our ability to abstract many of the differences between clusters.

Now, as we move towards Kubernetes, we will share lessons for shifting developers between different container orchestrators as seamlessly as possible by using Spinnaker as a common continuous deployment tool.

avatar for Jamie Hewland

Jamie Hewland

Site Reliability Engineer, Praekelt.org
Jamie is a Site Reliability Engineer (SRE) at Praekelt.org, a non-profit organisation that builds health and youth-related services that aim to improve the wellbeing of millions of people around the world. Jamie has contributed to several popular open-source projects, but is most... Read More →

Tuesday December 11, 2018 11:40am - 12:15pm
4C 1/2


Cloud Native Transformation - Pavel Nikolov & Matt Hope, Fairfax Media
This is a story about a 185-year-old publishing company which goes through cloud native transformation. In less than a year the team manages to switch from releasing their monolith monthly to releasing their microservices many times a day. Pavel will describe the process of adopting more than half of the CNCF incubating projects, creating hundreds of Kubernetes namespaces in multiple clusters, while ensuring excellent automation and observability practices. And with all this, the vast majority of developers in the team do not need to use kubectl, helm or anything other than git and chat bots. Damian will talk about what this transformation means for the business and how it affects the company culture.

avatar for Matt Hope

Matt Hope

Principal Systems Engineer, Fairfax Media
Matt is a Principal Systems Engineer at Fairfax Media, the company behind some of Australia’s oldest and most widely read news publications, such as the Australian Financial Review, Sydney Morning Herald and The Age.Matt has been working with IT Infrastructure for the past twenty... Read More →
avatar for Pavel Nikolov

Pavel Nikolov

Principal Software Engineer, Fairfax Media
Pavel is a Principal Software Engineer at Fairfax Media. During his 13 years of professional experience he has worked on front end projects, backend services, data streaming solutions, as well as distributed systems with many thousands of requests per second.  Pavel is interested... Read More →

Tuesday December 11, 2018 1:45pm - 2:20pm
4C 1/2


The Evolution of the AppDirect Kubernetes Network Infrastructure - Pierre Lacerte & Alex Gervais, AppDirect
In this talk, we’ll discuss the joys and pain the drove the evolution of the AppDirect ingress architecture, from NodePorts, to HAProxy/Consul, and ultimately to the open source Ambassador API Gateway. At AppDirect, we run 12 Kubernetes clusters, handling thousands of requests/minute, and we rely on our gateway to provide key cross-cutting concerns like authentication, rate limiting and observability. Join us to learn about how both development and ops teams use Ambassador, and how this acts as a control plane to the underlying Envoy data plane, which, for example, allows us to specify routing and rate limiting via Kubernetes annotations. We’ll also talk about our ingress roadmap and the planned future for the API Gateway as AppDirect scales.

avatar for Alex Gervais

Alex Gervais

Staff Software Developer, AppDirect
Outdoorsy, data-driven, eternal student, not so geeky creative entrepreneur and traveler. Alexandre is a curious, introverted and humble character. Working by day as a Staff Software Developer at AppDirect he has many years of savoir-faire building full-stack systems from cloud infrastructures... Read More →
avatar for Pierre Lacerte

Pierre Lacerte

Director Engineering, AppDirect
Pierre is a Director of Software Development at AppDirect and currently leads the Platform and Data & Analytics teams. After receiving a B.Ing Computer Engineering from Université de Sherbrooke, he worked as a full stack engineer at Morgan Stanley for 5 years. In 2014, he joined... Read More →

Tuesday December 11, 2018 2:35pm - 3:10pm
4C 1/2


Kubernetes at Reddit: An Origin Story - Greg Taylor, Reddit, Inc
Despite being ranked in the top 20 busiest sites in the world, Reddit has been a small engineering organization for most of its 13 years. However: the last few years have seen us multiply in size by 7x, causing our tooling, process, and culture to creak and groan. Discussion, evaluation, and experimentation led to our decision to incorporate Kubernetes.

This session takes the audience through the motivations, the story, and the results of rebasing our infrastructure on Kubernetes. Today, Kubernetes forms the core of our internal Infrastructure "product", which is a paint-by-numbers experience that engineers of all backgrounds make use of.

The audience will see how and why a small (but rapidly growing) organization operating at a unique scale shifted to Kubernetes, get practical tips and advice, and _maybe_ some dank memes.

avatar for Greg Taylor

Greg Taylor

Engineering Manager, Reddit, Inc
Greg Taylor leads the Release Engineering team within the Reddit's Infrastructure division. He and his team steward the internal Kubernetes-based infrastructure product (InfraRed) and build tooling and process to empower service owners to get their ideas to production. Greg has recently... Read More →

Tuesday December 11, 2018 3:40pm - 4:15pm
4C 1/2


CI/CD in Light Speed with K8s and Argo cd - Billy Yuen & Parin Shah, Intuit
Enterprises have benefited greatly from the elastic scalability and multi-region availability by moving to AWS, but the fundamental deployment model remains the same.

At Intuit, we have adopted k8s as our new saas platform and re-invented our CI/CD pipeline to take full advantage of k8s. In this presentation, we will discuss our journey from Spinnaker to Argo CD.

1. Reduce CI/CD time from 60 minutes to 10 minutes.
2. Reduce production release (or rollback) from 10 minutes to 2 minutes.
3. Enable concurrent deployment using spinnaker and argo cd as HA/DR to safely adopt the new platform with no downtime.
4. Be compatible with the existing application monitoring toolset.

avatar for Parin Shah

Parin Shah

Software Engineer, Intuit, Inc.
Parin is a software engineer on Intuit's Payments platform team. He was one of the early adopters of Kubernetes at Intuit and helps accelerating other teams to move to AWS/Kubernetes. He focuses on using modern CI/CD principles and other cloud services to increase developer velocity... Read More →
avatar for Billy Yuen

Billy Yuen

Principal Engineer, Intuit
Billy Yuen is a principal engineer on the Intuit’s Platform team focusing on the modern Saas adoption (AWS and kubernetes), system resiliency, and monitoring. Previously, Billy worked on Netflix’s Edge Services team to build the next generation of edge-service infrastructure to... Read More →

Tuesday December 11, 2018 4:30pm - 5:05pm
4C 1/2
Wednesday, December 12


Abstracting Heavy Lifting: How K8s Speeds Up Multi-Cloud Solution Delivery - Rob Zuber, CircleCI
At CircleCI, we build a complex, distributed system for CI/CD used by thousands of sophisticated teams. While we started as a single-cloud, SaaS product, we’ve grown to serve huge customers, across clouds and behind customers’ firewalls. With scale came issues in our ability to build, distribute, and manage our product.

CircleCI CTO Rob Zuber will discuss how we hit the limits of distribution via an unsophisticated orchestration engine, and how consolidating on K8s helps us control networking, security, and availability while minimizing one-off work. K8s also allows us to live a value: only building what is a core differentiator to our business.

Even if your team isn’t selling a distributed system, you can apply what we’ve learned. Abstracting your heavy lifting makes it possible to escape the whims of AWS, GCP, or Azure, reduce maintenance, and ultimately, helps drive velocity.

avatar for Rob Zuber

Rob Zuber

CTO, CircleCI
Rob Zuber is a 20-year veteran of software startups; a four-time founder, three-time CTO with past investments from Google Ventures, Relay Ventures and Foundation Capital as well as exits to Appconomy and CircleCI. Since joining CircleCI, Rob has seen the company through its Series... Read More →

Wednesday December 12, 2018 10:50am - 11:25am


Our Journey to Service 5 Million Messaging Connections on Kubernetes - Dylan O'Mahony, Bose & Dave Doyle, Connected
We propose to present a case study on how we built a high-scale messaging service and state store on Kubernetes. The solution supports millions of persistent, concurrent connections; enables tens of thousands of messages per second; is globally addressable; stores millions of states; and responds with minimal latency (<250ms).
To evaluate build approaches, the team split into Makers & Breakers. Makers developed the solution stack while Breakers focused on repurposing Locust, a high-scale load testing framework, to simulate behavior. Leveraging the flexibility of Kubernetes, we were able to scale the stack and solve blockers on the path to a viable solution. Blockers included ingress, file descriptors, service discovery and resource limits. The experience was deeply educational, generating key learnings for developers tasked with building a scaled solution on top of Kubernetes.

avatar for Dave Doyle

Dave Doyle

Software Engineer Manager, Connected
Dave Doyle leads a diverse team of engineers, exploring the viability of high-scale, low-latency messaging solutions for next-gen IoT applications using VerneMQ and Cassandra. He spent the last 18 years building high-performance systems across a range of technical domains. He has... Read More →
avatar for Dylan O'Mahony

Dylan O'Mahony

Cloud Architecture Manager, Bose
Dylan has spent his entire career at Bose, focused on bringing new technology and software solutions to bear on the business. His journey has taken him from managing the bose.com web technology team, to running an IT innovation lab to, most recently, leading the architecture of a... Read More →

Wednesday December 12, 2018 11:40am - 12:15pm


Evolving Legacy Systems into Kubernetes at Lyft: A Hybrid Environment - Lita Cho & Jose Nino, Lyft
Before Kubernetes, Lyft has hundreds services that deployed as VMs in AWS. Moving over to Kubernetes is a big undertaking. This talk will go over how we leveraged Envoy and the go-control-plane (the service that hosts Envoy configuration and discovery API) to split traffic between our legacy infrastructure and our Kubernetes infrastructure. This talk will demonstrate how Lyft handled service discovery within our hybrid environment, various ways to control the traffic when Kubernetes issues came up, and how Lyft used Envoy as a sidecar container to keep in par with the legacy behavior. Lastly, this talk will discuss the various challenges we faced with this migration plan but why it was needed due to the size of Lyft’s legacy infrastructure.

avatar for Lita Cho

Lita Cho

Senior Software Engineer, Lyft
Lita is a senior software engineer on the Networking team, building out the service mesh to handle both Kubernetes and legacy systems at Lyft. She also maintains the tracing infrastructure at Lyft. Before that, she worked on building out the API infrastructure using Protocol Buffers... Read More →

Jose Nino

Software Engineer, Lyft
Jose Nino worked on Lyft’s Networking team for 2+ years building out infrastructure that enabled Lyft to scale technically and socially as it developed and rolled out an Envoy-based service-oriented architecture. He was instrumental in building control plane technologies, and resilience... Read More →

Wednesday December 12, 2018 1:45pm - 2:20pm


Automating Research Workflows at BlackRock - Matthew Magaldi & Vaibhav Page, BlackRock
BlackRock’s Data Science Platform enables investors and users across the firm to access a wealth of financial data using research models. Event-based workflows play an integral role in data driven modeling. In this presentation, the speakers will give an overview of BlackRock’s Data Science Platform, talk about the transition from vanilla Kubernetes Cron jobs to more complex sensor-driven research workflows. They will discuss how these workflows powered by Argo Events, an open source event-based dependency manager for Kubernetes, form a core component of the platform. They will also review the challenges of building automated financial models and look forward to the class of problems this framework solves.

avatar for Matthew Magaldi

Matthew Magaldi

Analyst, BlackRock
Matt Magaldi is an Analyst at BlackRock working on the Aladdin Wealth platform. Despite being relatively new to the open-source community, Matt leads the Argo Events project. Matt graduated from Cornell University in 2016 with a Bachelor of Science in Electrical and Computer Engi... Read More →

Vaibhav Page

Analyst, BlackRock
Vaibhav Page is an Analyst at BlackRock working on the Data Science platform. He primarily works on building out the cloud native scheduler for event-driven workflows. Along with Matt, Vaibhav created Argo Events.

Wednesday December 12, 2018 2:35pm - 3:10pm


How Atlassian Built Our Own Kube Clusters and Why You Shouldn’t Do the Same - Nick Young, Atlassian
Three years ago, deciding between building or buying k8s infrastructure was simple. Few vendors existed and none that could support Atlassian's self-imposed requirements as we sought to replace our hand-rolled PaaS.

Fast forward to today and our internal Kubernetes now run both batch and stateless service workloads. If we repeated our build vs buy decision, the outcome is no longer clear.

Come along to hear about how we designed and built our Kubernetes clusters to run on our cloud platform and why we think you’re probably better off starting on a managed service today.

avatar for Nick Young

Nick Young

Principal Engineer 首席工程师, Atlassian
Nick has been working to prevent the entropic downfall of systems for 20 years, across Windows and Linux, datacenters and clouds, networking, storage and compute. Currently, he's a Principal Engineer in Atlassian's Kubernetes Infrastructure Technology Team, where in addition to his... Read More →

Wednesday December 12, 2018 3:40pm - 4:15pm


Towards Trading on Kubernetes: Operating Multi-Tenant and Secure Clusters - Andrew Kochut & Javier Diaz-Montes, Two Sigma
Two Sigma, a financial company, performs large-scale data processing for modeling and trading while facing risks, such as data exfiltration. We present how we addressed this by building multi-tenant Kubernetes clusters to run over 500 services on 30K cores and 200TB of RAM. These include parts of our trading system and our document translation system, build and test farms, and artifact caches.

Kubernetes doesn't provide full tenant isolation so users often create per-tenant clusters. Two Sigma has many teams with unique data and service access needs, so such a model would have large overheads. We built multi-tenant clusters by coupling namespace, RBAC and PSPs with Two Sigma’s entitlement system. We also integrated Kerberos via annotations to inject tickets, keytabs, and SSL certs into Pods. We discuss lessons operating this both on-prem and public cloud, including pros and cons of GKE.

avatar for Javier Diaz-Montes

Javier Diaz-Montes

Software Engineer, Two Sigma
Javier Diaz-Montes is currently Software Engineer at Two Sigma, where is a member of the Platform Orchestration team working on enabling Kubernetes firm-wide. Before joining Two Sigma, he was Assistant Research Professor at Rutgers University (2012 to 2016), where he focused on researching... Read More →

Andrzej (Andrew) Kochut

Vice President, Two Sigma
Andrzej (Andrew) Kochut serves as a Vice President and engineering manager at Two Sigma. His responsibilities include container based compute, software deployments, and load balancing. His team focuses on transforming Two Sigma to embrace cloud-native design paradigm via adoption... Read More →

Wednesday December 12, 2018 4:30pm - 5:05pm
Thursday, December 13


How T-Mobile Built and Scaled Kubernetes On-Prem for the 2018 Retail Season - James Webb, T-Mobile
Have you ever supported a major retail craze like the immediate spike in sales during a new smartphone launch? We have! We've deployed Kubernetes on-premise to support upcoming new smartphone launches and the busiest days of the year for T-Mobile including Black Friday & Cyber Monday. Through our “fail fast, fail forward” approach, we learned many valuable lessons on the way to being successful. Tons of research and testing took place, because failure in production was not an option. Millions of customers purchase smartphones on several key dates, producing huge spikes in transactions across dozens of internal business applications. Come hear our story of how we planned a Kubernetes project for the real world, including the biggest surprises, and keys to success. We’ll help you understand all the things you may not know about deploying on-premise at scale.

avatar for James Webb

James Webb

Member of Technical Staff, T-Mobile
James is a senior technical leader at T-Mobile who most recently co-lead T-Mobile's massively successful Cloud Foundry initiative for the past two years and is currently working on bringing more container goodness to the enterprise in the form of on-premise Kubernetes.

Thursday December 13, 2018 10:50am - 11:25am
Ballroom 6A


Migrating Internet Archive to Kubernetes - Tracey Jaquith & David Van Duzer, Internet Archive
The Internet Archive has been on a mission to provide universal access to all knowledge for 22 years. While it continues to preserve the history of the web, the Archive must keep its infrastructure aligned with the present and future of the web. Tracey and David will review the Kubernetes migration: the 600,000 line PHP monolith powering core library operations, as well as a wide variety of auxiliary services (like web crawling) supported by many development teams. They will cover deploying Kubernetes at progressively larger scales in the Archive's own data centers, and integrating existing developer workflows with GitLab, Auto DevOps, Helm and Prometheus.
Tracey Jaquith has architected much of Internet Archive since 1996 (starting with founder Brewster Kahle) and focuses on DevOps, TV, audio/video, and AR/VR.
David Van Duzer manages web-wide crawl operations at Internet Archive.

avatar for David Van Duzer

David Van Duzer

Software Engineer, Internet Archive
David Van Duzer has spent 20 years professionally operating Internet systems, which mostly consists of staring at blinking lights, and asking developers annoying questions. He joined the Internet Archive in 2017 to manage web-wide crawl operations.
avatar for Tracey Jaquith

Tracey Jaquith

TV Architect, Internet Archive
Tracey was the founding coder and system architect for Internet Archive in 1996. Recently, she rewrote Archive's TV recording system to an opensource single server system, capable of 75 simultaneous 24x7 channels, made the TV site "full stack", and brought archive.org website to "version... Read More →

Thursday December 13, 2018 11:40am - 12:15pm
Ballroom 6A


Machine Learning the Kubernetes Way - Ania Musial & Keith Laban, Bloomberg
The Bloomberg Professional Service (aka "the Terminal") provides data, analytics, news, information and communication to professionals in business, finance, government, and philanthropy. Through our internal machine learning platform, Bloomberg is applying advanced AI and GPU-accelerated compute to dozens of domains such as NLP, computer vision, time-series analysis and personalization.

In this talk we'll discuss how we adapted Kubernetes to support a diverse set of machine learning workloads; how an ecosystem of data connectors enables rich collaboration; how we address multi-tenancy, data security and privacy; and finally, how we support heterogenous compute requirements and hardware acceleration.

avatar for Keith Laban

Keith Laban

Senior Software Engineer, Bloomberg
Keith Laban is a senior compute infrastructure engineer at Bloomberg. He works on leveraging Kubernetes as the building blocks of a machine learning platform. Keith has given internal talks at Bloomberg about Kubernetes and building custom controllers (operators). Keith has worked... Read More →
avatar for Ania Musial

Ania Musial

Senior Software Engineer, Bloomberg
Ania Musial is a Senior Software Engineer in the ML Platform team at Bloomberg, focusing on components required for a more robust machine learning model development lifecycle. Previous projects in her 9 year tenure at Bloomberg include building recommendation system infrastructure... Read More →

Thursday December 13, 2018 1:45pm - 2:20pm
Ballroom 6A


Monolith to Microservice: Pitchforks Not Included - Jason Plum, GitLab
Learn how GitLab turned it’s omnibus into cloud native Helm charts by way of containerization and orchestration. This talk aims to help practitioners already running large scale, successful products make decisions on how to move to microservices while maintaining product development cadence and serving customers on legacy software everyday. It’s like driving a race car and fixing it as you are competing in a race, without pit stops.

We will cover:
- How we made the application stack capable of scaling via containerization, through many changes to stateful behaviors.
- Why we made the changes from an architectural view.
- How on earth we accrued the technical debts we had to fix in the first place.
- Most importantly, we’ll demonstrate why the monolith concept was the right place to start, but Kubernetes is our future.

avatar for Jason Plum

Jason Plum

Sr Distribution Engineer, GitLab
Jason Plum is a Senior Software Engineer, Distribution at GitLab, providing a large number of deployment methods for GitLab’s product. He has worked extensively to build GitLab’s cloud native Helm charts, enabling the ease of use that GitLab customers have come to expect, but... Read More →

Thursday December 13, 2018 2:35pm - 3:10pm
Ballroom 6A


HomeAway's Multi-Cloud Traffic Director - Mark Tyrrell & Andrew Hodges, HomeAway
Case Study about how HomeAway built a global Multi Cloud Traffic Management solution using Content Distribution Networks and a hybrid of public and private clouds. Over the last two years we have grown a team dedicated to extending our capabilities related to routing traffic to thousands of applications, microservices and functions produced by our developers.

HomeAway's stack leverages CNCF projects including Linkerd, Open Tracing, Prometheus as well as Kafka streams to support plug-and-play adoption of future observability components.

The talk will be supported by live demos and example code.

avatar for Andrew Hodges

Andrew Hodges

Engineering Manager, HomeAway
Andrew leads HomeAway’s Edge development team, bringing his experience with building open source traffic management platforms since 2014. In conjunction with the Linkerd service mesh, the team is currently developing a global control plane with a multi-cloud backing.
avatar for Mark Tyrrell

Mark Tyrrell

Principal Architect, HomeAway
Bootstrapped HomeAway's Edge development team after writing a Java experiment-based traffic director and adopting a service mesh based on Consul and Linkerd in 2016. Supports HomeAway's Cloud Engineering organization as it grows to tackle challenges around Multi Cloud Platforms-as-a-Service... Read More →

Thursday December 13, 2018 3:40pm - 4:15pm
Ballroom 6A


Panel Discussion: Avoiding the Weeds in the Cloud Native Landscape – Moderated by Priyanka Sharma, GitLab
The CNCF is one of the seminal technology organizations of our time and has come to house a plethora of projects and technologies that are relevant to the ecosystem it has created. In this heyday of technical innovation, it can be hard for organizations to decide what tooling and workflows to use as they move to cloud native. There is the tension between best of breed point solutions and what you need across the organization. This panel brings together practitioners who have made infrastructure and tooling decisions for large-scale companies and businesses and offers their perspective.

The questions we will cover are:
* What are the main categories of tooling when someone moves cloud native?
* How do you choose tools - best of breed or integrated solutions?
* Share a horror story about a tool/solution your company chose.
* Share a success story about a tool/solution your company chose.

avatar for Priyanka Sharma

Priyanka Sharma

Director of Technical Evangelism, GitLab
Priyanka Sharma is the Director of Cloud-Native Alliances at GitLab Inc. She also serves on the board of the Cloud Native Computing Foundation (CNCF) and has deep expertise in DevOps and observability. A former entrepreneur with a passion for growing developer products through open... Read More →


Brendan Aye

Director, Platform Architecture, T-Mobile
avatar for Melissa Chapman

Melissa Chapman

Director of IT - PaaS & Config Mgmt, CVS Health
avatar for Jasmine James

Jasmine James

IT Manager - DevOps CoE, Delta Air Lines
avatar for Matt Klein

Matt Klein

Creator, Envoy
Matt Klein is the creator of Envoy and a software engineer at Lyft. He has been working on operating systems, virtualization, distributed systems, networking and making systems easy to operate for nearly 20 years across a variety of companies. Some highlights include leading the development... Read More →

Thursday December 13, 2018 4:30pm - 5:05pm
Ballroom 6A