https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Graph databases are fundamentally designed with a focus on the relationship between data sets. Running stateful workloads like databases on a container platform has always been a challenge. How to unleash the power of your connected data on Kubernetes with cloud-native technologies? Nebula Graph is a highly performant linearly scalable graph database available for use via a shared-nothing distributed model. KubeSphere is ...
Dec 01, 2021•1 hr 2 min•Season 1Ep. 105
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK On-demand infrastructure services have become a key requirement for developers and DevOps engineers to provision and consume the resources they need. While it can be achieved in public cloud with the support of a rich API ecosystem, there's a lack of common framework across CSP's. As the next-generation Cloud Operating System, Kubernetes provides a variety of services that can be used to reach the same goal, but with a co...
Nov 24, 2021•1 hr 9 min•Season 1Ep. 104
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Spark and lakeFS are revolutionizing large scale data processing that is version-aware. Is it possible to run this architecture over Kubernetes? We’ll cover the fastest way to get this environment up and running, and the benefits you get with it. Finally we’ll show how horizontal scaling and the lakeFS Hadoop Filesystem avoid processing bottlenecks as workloads increase. BIO Itai is a R&D team leader at Treeverse, the...
Nov 19, 2021•39 min•Season 1Ep. 103
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK A few short years ago I was a huge opponent of putting any data in Kubernetes, now I'm one of the most vocal proponents of doing so. Listen to how the ecosystem changed my mind. BIO An experienced technologist who has been hands-on in solving data problems for as long as I can remember. Currently working on changing the way we use storage in the datacenter....
Nov 17, 2021•1 hr 2 min•Season 1Ep. 102
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK CAPE is an open source solution offering advanced kubernetes features for DR, data migration and mobility, multi-cluster app deployment etc. I will demo how to install CAPE and also cover how to backup and restore data using CAPE KEY TAKE-AWAYS Learn about a new open source tool for kubernetes backup/restore BIO I come from a strong infrastructure background with expertise in infrastructure automation, containers, kuberne...
Nov 15, 2021•1 hr 3 min•Season 1Ep. 100
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK How to build a stateful, transactional streaming storage engine on top of kubernetes. KEY TAKE-AWAYS FROM THE TALK 1. Building storage engines in kubernetes is hard 2. You need tiered storage (S3 or google cloud bucket, Azure Blob) for total disaster recovery 3. You also need sound replication inside the cluster 4. You need a simple architecture so you can scale the pods easily BIO Alex Gallego is the founder and CEO of V...
Nov 12, 2021•1 hr 1 min•Season 1Ep. 101
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK ETL/ELT on Kubernetes is currently an unsolved problem. There are a lot of different approaches vying for a spot as the de facto method, but none are clear winners. Considering that the cloud-native landscape is built for deploying Dockerized, open-source software, many of the closed-source solutions fall flat and don't mesh with the trajectory of the community. Airbyte is an open-source ETL/ELT tool that harmonizes well ...
Nov 05, 2021•1 hr 1 min•Season 1Ep. 99
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK With the help of 3rd parties, running stateful workloads can be a “trivial” experience but what happens when you need to migrate between vendors? Civo offers managed Kubernetes clusters powered by K3s to customers around the world. We manage thousands of Virtual Machines and stateful customer data within multiple data centres across the globe. During recent vendor contract renewal negotiations, it was decided to move to a...
Nov 03, 2021•49 min•Season 1Ep. 98
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Cassandra is a highly scalable database with an architecture that makes it well suited for multi-region workloads. A Kubernetes cluster often spans across multiple zones within a single region. Multi-region Kubernetes clusters are less common though due to the challenges that they present. This has led to a growing number of multi-cluster solutions. In this presentation John Sanda introduces K8ssandra Operator. It is desi...
Oct 29, 2021•1 hr 2 min•Season 1Ep. 97
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK There is confusion amongst Kubernetes users as to when you should make a Deployment with a PVC and when you should use a StatefulSet with a PVC. There is also a general lack of understanding when it comes to disk access policies, what RWO/RWX mean and what they allow you to do. These concepts are complicated and require a deep level of understanding in order to avoid users making bad decisions that they come to regret lat...
Oct 27, 2021•51 min•Season 1Ep. 96
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Workloads tell the true story of how Kubernetes is being used in production these days. We talked to 200 architects and DevOps engineers about the nitty gritty deployment details, especially around data-intensive workloads. The result is a report with quantitative information around workload numbers and types, preferred production environments, and what kind of teams are running it all. KEY TAKE-AWAYS Organizations are in...
Oct 08, 2021•1 hr 2 min•Season 1Ep. 95
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Prima and Tammy join us to discuss the bridges between Security and SRE. How can these two teams work best together? What can they learn from each other? Prima is a Security Engineer and Tammy is a Site Reliability Engineer. They are both Australians living in the USA with 10+ years of experience each working in tech. TALK TAKEAWAYS 1.You'll learn tips for SRE and Security teams to work together 2.You'll learn what SREs c...
Oct 06, 2021•1 hr 2 min•Season 1Ep. 94
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Discussion around the drivers for DevOps data management and the primary data wait cases in the typical value stream. You’ll discover how data services solutions that enable instant data recovery and data copy will immediately accelerate your CI/CD pipeline. TAKE-AWAYS We’ll discuss: - the specific data wait time cases that could be bogging down your processes - the infrastructure cost-efficiency of a data services platfo...
Oct 05, 2021•47 min•Season 1Ep. 93
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK eBPF is one of the key technologies nowadays. There are several existing technologies in network or observability fields but not much in storage space. This presentation tells my research story and tries to define some of the possibilities of the technology. eBPF is a key technology thats found in the market today. It is apparent that there are several existing technologies in both network or observability fields but not ...
Oct 01, 2021•59 min•Season 1Ep. 92
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK This talk will introduce Druid-Operator ( https://github.com/druid-io/druid-operator ) and how Kubernetes and Operator framework can be used to write an operator that enables provisioning, management, and scaling of a complex cluster of Apache Druid to 1000s of nodes. This talk will cover why Kubernetes and the operator framework is a perfect fit for managing a complex stateful application. Learnings and pitfalls faced wh...
Sep 30, 2021•56 min•Season 1Ep. 91
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Database Administrators (DBAs) have jobs today because someone recognized that data is critical. It's critical to businesses, who need it to make good decisions. It's critical to users, who need to know how many people liked their recent post. DBAs developed process and maybe even automation to ensure that this critical data is available and secure, while accessing it is fast. And now someone wants them to put that critic...
Sep 25, 2021•1 hr 11 min
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK You want to build a Kubernetes Operator for your software. Which tool to choose? Operator SDK with Helm, Ansible, or Go? Or maybe start from scratch with Python, Java, or any other programming language? And what is the right phase in the Operator Capability/Maturity Model that you should provide? In my talk I'll present: - Different ways of building Kubernetes Operators - Demo of building the same Operator using different...
Sep 17, 2021•1 hr 3 min•Season 1Ep. 90
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Azure Arc enabled Data Services allows organizations to run the Azure dialect of database-as-a-service on Kubernetes on virtually any cloud or infrastructure. This session will cover the architecture of this new product, how it is deployed, the dialects of database engine that it currently supports and the storage considerations for deploying this service. BIO Chris has been working with databases for over twenty years in...
Sep 16, 2021•59 min•Season 1Ep. 89
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK The National Security Agency (NSA) and the Cybersecurity and Infrastructure Security Agency (CISA) have published comprehensive recommendations for strengthening the security of an organization’s Kubernetes system to help companies make their Kubernetes environment more difficult to compromise. one of the key areas this report covers is how to configure K8s to prevent data breaches. in this session, Leonid Sandler, ARMO C...
Sep 15, 2021•1 hr 11 min•Season 1Ep. 88
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Building a gateway api architecture is not an easy task. In this presentation Leonardo will talk about how the microgateway strategy can be used to resiliently and reliably expose apis in a cloud native environment while increasing the productivity of engineering teams. BIO Experienced Senior Solutions Architect and DevSecOps Engineer with a demonstrated history of working in the information technology and services indust...
Sep 13, 2021•1 hr 4 min•Season 1Ep. 87
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK What do companies like Apple, Netflix, Huawei or Instagram have in common? Aside from billions of requests, petabytes of data, and millions of users around the world, all of these companies use Apache Cassandra™ - the NoSQL decentralised database, well known by data experts for its scaling capabilities, high availability and incredible performance. Join us for the Introduction to Apache Cassandra workshop to learn the fun...
Sep 13, 2021•2 hr 3 min
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK This session will talk about the considerations and tools available when it comes to performance and design of Kubernetes running in a virtualised environment. BIO Working at VMware as a Cloud Management specialist focusing on automation and cloud native technologies. KEY TAKE-AWAYS FROM THE TALK From this session you will learn what tools are readily available today to help you benchmark your platforms....
Sep 09, 2021•1 hr 4 min•Season 1Ep. 86
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Randy will discuss the increasing importance of DataOps to the modern analytics-driven enterprise and how it integrates with continuous development and operations environments. He’ll illustrate how Kubernetes enables DataOps across four common patterns used for continuous data integration: batch, streaming, microservice and ETL solutions. Randy will also explain how these design patterns – in concert with automation, CI/C...
Sep 09, 2021•57 min•Season 1Ep. 83
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK We all heard about how Kubernetes is self healing. However, what happens if you lose your Kubernetes cluster and your data? Is there a way to recover that? This talk will explore the why and how on backing up and restoring Kubernetes cluster. The talk will focus on three scenarios for backup and restore: - Kubernetes control plane - Applications - Data A comparison of different techniques will be presented with some live ...
Sep 08, 2021•1 hr 30 min•Season 1Ep. 85
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Kubernetes is a platform for building platforms and is not really developer-friendly. That's why red Kubes created Otomi, the first and only Kubernetes-based operating system. Install Otomi with a single Helm Chart, login, and start deploying applications with only a few clicks. Use Otomi to build your Enterprise Container Platform or Internal Developer Platform in minutes. In this talk, I'm going to introduce Otomi. We'l...
Sep 03, 2021•1 hr 7 min•Season 1Ep. 84
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK A smart person learns from their own mistakes, but a truly wise person learns from the mistakes of others. When launching our product, we wanted to learn as much as possible about typical pains in our ecosystem, and did so by reviewing many post-mortems (100+!) to discover the recurring patterns, anti-patterns, and root causes of typical outages in Kubernetes-based systems. In this talk we have aggregated for you the insi...
Sep 01, 2021•52 min•Season 1Ep. 82
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Join this session to learn: - why reliability is a key feature of your service - why SLOs are the best way to measure it and set reliability targets - how error budgets increase your velocity - what an error budget burn rate is - how to set up alert using burn rates to optimize your response and operational load BIO Yuri works with Google Cloud Platform customers to help them design, architect, build, and operate reliable...
Aug 27, 2021•56 min•Season 1Ep. 81
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Keptn is an event-driven orchestration platform with SRE best practices (declarative pipelines, quality gates and automated remediation) available "out of the box". This talk will be an introduction to Keptn, the core usecases and Keptn in the wider IT ecosystem BIO Brad started his career as a Software Engineer at TripAdvisor and then got into DevOps and moved into Fintech where he led the Cloud Engineering Innovation te...
Aug 27, 2021•1 hr 3 min•Season 1Ep. 80
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK When I founded Kesque in 2019, my goal was to Kubernetes as the base platform for all our software. Because Kesque was a cloud-based SaaS powered by Apache Pulsar, that meant getting Apache Pulsar, a high-performance streaming solution and Kafka alternative, up and running in Kubernetes. In this talk, I will give an overview of Apache Pulsar and describe how we got Pulsar up and running in Kubernetes. We will cover some o...
Aug 26, 2021•1 hr 7 min•Season 1Ep. 79
ABSTRACT OF THE TALK Volcano is a system for running high-performance workloads on Kubernetes. It features powerful batch scheduling capability that Kubernetes cannot provide but is commonly required by many classes of high-performance workloads, including Machine Learning, Deep Learning, Big Data, Bioinformatics Computing, etc. These types of workloads typically run on generalized domain frameworks like TensorFlow, Spark, PyTorch, MPI, etc. In this talk, Kevin will introduce how Volcano is inte...
Aug 26, 2021•1 hr 10 min•Season 1Ep. 78