Ebay has built a GraphDB cloud service called NuGraph, which is based on JanusGraph. FoundationDB is chosen as the JanusGraph’s storage plugin, because of its high-performance and distributed transaction support. Jun Li and Hieu Nguyen from eBay present the GraphDB architecture, and focus on how they deploy and manage FoundationDB in Kubernetes, how they improve JansGraph query performance in a cross-data center environment, how they bulk load the graph into FoundationDB with its transactional support, and how they secure the 3-tier cloud service with limited security support from FoundationDB.
As Lyft migrated its applications to Kubernetes, assumptions baked into the networking layer were tested. This talk discusses how Lyft used Envoy’s xDS protocol to design their own flexible service mesh and handle new challenges from a multi-cluster architecture such as:
- Routing across multiple Kubernetes clusters
- Handling Deployments
- Rapid scale-in and scale-out
- Service Discovery
- Active/Passive Health Checking
- Readiness in the service mesh
This talk will also go over changes that were made in the Envoy codebase to make this work.
This talk highlights the the evolution of Redis support in Envoy. Initially Envoy redis proxy only supported sharding to clusters of independent Redis nodes. Recent developments have enabled support for the open source Redis Cluster protocol as well as some unique features such as multicluster routing, flexible load balancing options, and traffic shadowing.
As the usage of Redis expanded different usage patterns emerged, requiring different availability, durability and consistency trade-offs. Henry Yang from Lyft and Mitch Sulaski from Workday discuss how the Envoy redis proxy was extended to support these new requirements in large scale environment(10+ Millions rps) at Lyft and Workday.
In this talk Anoop Koloth and Hanzhang Wang from eBay present how they managed to build a monitoring system and leveraged data generated from envoy clusters:
(1) Processing billions of hits served from different platforms from worldwide in real-time.
(2) Key Performance Indicators from Envoy ecosystem.
(3) Effective ML solution for proactive monitoring diversified eBay systems.
(4) Graph-based modeling and algorithms to deal with system complexity.
(5) Symbiosis and enhancement with existing SRE solution.
Dylan Carney a Software Engineer at Stripe discusses how Stripe operates compute infrastructure around the globe in order to provide low latency and high availability to its users, using Envoy to connect this infrastructure together. To ensure reliability and operability, Stripe built an Envoy control plane, along with new tooling to manage it. New XDS services provide features like ramp-up of traffic to specific compute clusters, tiered failover, and per-customer routing of API requests. Stripe also built their blue/green deployments on top of this. Finally, they made improvements to make Envoy significantly more reliable in the face of issues like packet loss and head-of-line blocking, based on traffic patterns and behavior observed in production.
Learn how Lyft built, and deployed Envoy Mobile (envoy-mobile.github.io) in their Swift/Kotlin apps and the motivation behind deploying a single, consistent Envoy-based network stack across every platform.