KubeCon + CloudNativeCon North America 2018: Full Schedule

Seattle, WA
December 10–13, 2018
Click Here for More Information & Registration
View Venue Map

10:50am PST

gRPC Performance; Tuning Applications and Libraries - Noah Eisen, Google

gRPC C++ team has been working on performance for over a year now. In this presentation we will share the story of our journey, sharing insights on tuning applications that use gRPC as well as optimizing the library itself.

All concrete examples will be from gRPC, but the high level concepts will be interesting to anyone who has worked on performance.

We will focus on:
- benchmarking
+ microbenchmarks
+ synthetic benchmarks
+ application benchmarks
+ cutting down noise on benchmarks
- tooling
+ flamegraphs
+ latency breakdowns
- concrete optimizations
+ tuning the threading model of gRPC apps
+ high performance network polling systems

Speakers

Noah Eisen

Software Engineer, Google

Noah Eisen, who hails from University of Michigan, has worked at Google for the past two year. For most of that time he has been on the gRPC C++ team, and within that, the performance sub-team. He now leads the team's benchmarking and tooling efforts, focusing on measuring the impact... Read More →

gRPC Performance; Tuning Applications and Libraries pdf

Thursday December 13, 2018 10:50am - 11:25am PST
4C 3/4

Performance

Skill Level Intermediate
Link to Session Recording https://youtu.be/cWlXQP9AORo

11:40am PST

Got a Need for Speed? Accelerate Your Prometheus Dashboard Using Trickster - Shilla Saebi & James Ranson, Comcast

We live in a world where high performance, and speed are essential. A few extra seconds of response time on a dashboard can be a deal breaker. Many dashboards request the entire time range of data from the time series database, every time a dashboard loads or reloads. This can result in slower rendering times, and different results depending on when the request is made. We are proud to announce that Trickster, a new open source project, was developed to address this very issue.

Written in Go, Trickster is a reverse proxy cache for the Prometheus HTTP APIv1 that considerably accelerates dashboard rendering times for any series queried from Prometheus. This is possible because of the delta proxy, step boundary normalization, and fast forward features. In the presentation, we will discuss how Trickster was developed at Comcast, and we will show you a live demo of the software.

Speakers

James Ranson

Principal Software Architect, Comcast

James Ranson is a Platform Software Architect currently living in Denver Colorado. He has been with Comcast for over 8 years specializing in creating software and platforms that operate efficiently and scale horizontally. He is an expert on software development and release management... Read More →

Shilla Saebi

Program Manager, Open Source, Comcast

Shilla Saebi is an Open Source Program Manager who focuses on community and has been with Comcast for almost a decade. She has worked in many diverse roles within the tech industry in positions ranging from operations engineering, system administration, customer service, and network... Read More →

Trickster KubeCon pdf

Thursday December 13, 2018 11:40am - 12:15pm PST
4C 3/4

Performance

Skill Level Any
Link to Session Recording https://youtu.be/ZO7nrX20iT0

1:45pm PST

eBPF Powered Distributed Kubernetes Performance Analysis - Lorenzo Fontana, InfluxData

Since the Linux kernel 4.x series a lot of enanchements reached mainline to the eBPF ecosystem giving us the capability to do a lot more than just network stuff.
The purpose of this talk is to give an initial understanding on what eBPF programs are and how to hook them to programs running inside Kubernetes clusters in order to answer targeted questions at cluster level but about very specific fine-grained situations happening in our programs and systems, like:
- Had that function in my program been called ?
- For a given function which arguments have been passed to it? And what it did return?
- Which TCP packets are being retransmitted?
- What are the queries running slow?
- Insights on programming language events/gc
- Had that file been opened?
Imagine a programmable Kubernetes performance analysis tool that runs at cluster level without performance implications how would you it to be?

Speakers

Lorenzo Fontana

Open Source Software Engineer, Sysdig

Lorenzo Fontana is an Open Source Software Engineer at Sysdig where he primarily works on Falco. He’s passionate about distributed systems, software defined networking, the Linux kernel and performance analysis.

eBPF Powered Distributed Kubernetes Performance Analysis KUBECON 2018 pdf

Thursday December 13, 2018 1:45pm - 2:20pm PST
4C 3/4

Performance

Skill Level Intermediate
Link to Session Recording https://youtu.be/GkMltybVHkU

2:35pm PST

Encoding 250,000 Songs a Day with batch/v1 Jobs - Leigh Capili & John Slivka, Beatport

Tasked with rebuilding the way we deliver music to DJ's, the Beatport Infrastructure team set out to use Kubernetes to construct scalable compute for executing batch and on-demand encoding workloads in order to level-up our customer's capabilities for playing and mixing dance music.

What would follow is a 5-month journey of building clusters, thrashing with software dependencies, and trudging through erratic performance and scalability issues with the kubernetes API.

How did we decide to use kubernetes?
Was it easy to prototype?

Is etcd capable of sustainably servicing 10,000 Jobs an hour?
How many Pods can the kubernetes API store?
How do you monitor and manage Job failures?

We'll walk you through our lessons learned and talk about our most exciting moments and deflating realizations.
Join us we re-tell the story of delivering a correct system to production :)

Speakers

Leigh Capili

Staff Developer Advocate, VMware

Leigh is an empathetic speaker and developer with niches in cloud-native systems and security. He has a background in building software to manage infrastructure. Leigh contributes to Kubernetes and Flux and is frequently working on his next software demo. He also co-maintains Ignite... Read More →

John Slivka

Infrastructure Engineer, Beatport

John Slivka works on the Infrastructure team at Beatport. Previously, he's worked for IBM Cloud and Oracle Cloud Object Storage as a Software Engineer.

Encoding 250 000 Songs a Day Dec13 pdf

Thursday December 13, 2018 2:35pm - 3:10pm PST
4C 3/4

Performance

Skill Level Intermediate
Link to Session Recording https://youtu.be/mvrFqRDBgqs

3:40pm PST

Performance Testing Ingress for Internet-Scale Workloads - Alexander Brand, Heptio

Have you ever wondered how much ingress traffic a Kubernetes cluster could handle? How many nodes would it take to handle the traffic of an Alexa top-40 website? Understanding these numbers and how your ingress infrastructure scales is critical when it comes to deploying internet-accessible applications in production.

At Heptio, we needed to prove that our Envoy-based ingress projects, Contour and Gimbal, would scale to support millions of concurrent connections, thousands of backend services, and thousands of virtual hosts.

In this talk, we will explore the strategies and tools we used, the challenges we faced and the lessons we learned while running these tests. We will dive into kernel tuning, HTTP benchmarking, Envoy metrics, and more. We hope that talking about our experience will help when it comes to performance testing your cloud-native applications and infrastructure.

Speakers

Alexander Brand

Systems Software Engineer, Heptio

Alex works at Heptio, helping customers realize all the benefits of Kubernetes and Cloud Native technologies. He is also a maintainer of the Heptio Gimbal project, a software load balancing platform that can route traffic to multiple Kubernetes and OpenStack clusters. He has been... Read More →

Performance testing Ingress Alexander Brand pdf

Thursday December 13, 2018 3:40pm - 4:15pm PST
4C 3/4

Performance

Skill Level Intermediate
Link to Session Recording https://youtu.be/eeRDOL2m07M

4:30pm PST

Automated Kubernetes Scalability Testing - Sebastian Jug & Naga Ravi Chaitanya Elluri, Red Hat

Kubernetes supports large clusters according to the docs, but how does it actually scale? Who came up with those limits? What are the actual numbers? To challenge this we built a CI/CD environment geared towards deploying, and testing, Kubernetes at Scale.

Our stack consists of Kubernetes, OpenStack for IaaS, Jenkins Pipeline, Ansible for automation, pbench a performance benchmarking, and visualizing tool, Prometheus and other Open Source projects. The stack has pushed the performance and scale limits of Kubernetes with kubelet, control plane and cluster density focused tests. In this presentation we will explore the story and challenges on how we built and tested this stack. We will demo the test harness and share the latest performance and scale results. Attendees will learn what the real scalability limits of Kubernetes are, as well as how to scale-test their own infrastructure.

Speakers

Naga Ravi Chaitanya Elluri

Senior Software Engineer, Red Hat

Naga Ravi Chaitanya Elluri is a Senior Software Engineer at Red Hat working on OpenShift Scalability, Reliability, Automation and Tooling to push the limits of OpenShift. His interest lies in cloud and distributed computing space and has contributed to various Open Source Projects... Read More →

Sebastian Jug

Senior Software Engineer, Red Hat

Sebastian Jug is a software engineer at Red Hat where he works on OpenShift and container runtime performance. Member of SIG-Scale, he creates tools (cluster loader) and enables large scale performance testing. As keeper of ‘the internet,’ resident of the master branch and FOSS... Read More →

Automated Kubernetes Scalability Testing pdf

Thursday December 13, 2018 4:30pm - 5:05pm PST
4C 3/4

Performance

Skill Level Any
Link to Session Recording https://youtu.be/37naDDcmDo4