Kubernetes Bytes

Ryan Wallner & Bhavin Shah

Kubernetes Bytes is a podcast bringing you the latest from the world of cloud native data management. Hosts Ryan Wallner and Bhavin Shah come to you from Boston, Massachusetts with experienced backgrounds in cloud-native tech. They'll be sharing their thoughts on recent cloud native news and talking to industry experts about their experiences and challenges managing the wealth of data in today's cloud-native ecosystem. read less
TechnologyTechnology

Episodes

KubeCon NA 2024 News Recap
18-12-2024
KubeCon NA 2024 News Recap
Join Bhavin Shah and Ryan Wallner for a recap of announcements and news from KubeCon North America 2024.  Check out our website at https://kubernetesbytes.com/ https://www.businesswire.com/news/home/20241119538933/en/Spectro-Cloud-Closes-75m-Series-C-Led-by-Growth-Equity-at-Goldman-Sachs-Alternativeshttps://northflank.com/blog/northflank-raises-22m-to-make-kubernetes-work-for-your-developers-ship-workloads-not-infrastructurehttps://snyk.io/news/snyk-acquires-developer-first-dast-provider-probely/https://www.nutanix.com/blog/introducing-nutanix-enterprise-ai https://thenewstack.io/stacklok-donates-minder-security-project-to-openssf/https://www.prnewswire.com/news-releases/akamai-launches-cloud-agnostic-ready-to-run-application-platform-302302446.html https://www.businesswire.com/news/home/20241112064093/en/Loft-Labs-Introduces-vCluster-Cloud-a-Managed-Solution-to-Simplify-and-Reduce-Costs-of-Kuberneteshttps://cloud.google.com/blog/products/containers-kubernetes/gke-65k-nodes-and-counting https://www.veeam.com/resources/wp-whats-new-veeam-kasten-for-kubernetes.htmlhttps://www.netapp.com/blog/trident-24-10-best-storage-kubernetes/https://podman-desktop.io/blog/2024/11/14/podman-desktop-cncf https://aws.amazon.com/blogs/aws/streamline-kubernetes-cluster-management-with-new-amazon-eks-auto-mode/https://aws.amazon.com/about-aws/whats-new/2024/12/amazon-eks-hybrid-nodes/https://thenewstack.io/kueue-can-now-schedule-kubernetes-batch-jobs-across-clusters/ https://cloudnativenow.com/kubecon-cnc-na-2024/solo-io-donates-api-gateway-to-cncf-to-advance-kubernetes-connectivity/https://blocksandfiles.com/2024/11/08/tintri-unveils-kubernetes-container-storage-interface-for-streamlined-management/ https://www.sdxcentral.com/articles/stringerai-announcements/kubiya-launches-captain-kubernetes-ai-tool-for-simplifying-kubernetes-management/https://cloudnativenow.com/topics/cloudnativedevelopment/application-dev/cncf-automates-kubernetes-secops-with-kyverno/https://www.cncf.io/announcements/2024/11/15/cloud-native-computing-foundation-expands-certification-to-platform-engineering-and-more/  https://training.linuxfoundation.org/platform-engineering-programs https://cloudnativenow.com/topics/cloudnativedevelopment/cncf-kubevirt-v1-4-vms-are-now-just-another-kubernetes-resource/https://kubevirt.io/user-guide/release_notes/kubernetes.io/blog/2024/12/11/kubernetes-v1-32-release/https://thehackernews.com/2024/12/296000-prometheus-instances-exposed.html?utm_source=tldrdevops
Inference in Action: Scaling Al Smarter with Inferless
24-10-2024
Inference in Action: Scaling Al Smarter with Inferless
In this episode, we sit down with Nilesh Agarwal, co-founder of Inferless, a platform designed to streamline serverless GPU inference. We’ll cover the evolving landscape of model deployment, explore open-source tools like KServe and Knative, and discuss how Inferless solves common bottlenecks, such as cold starts and scaling issues. We also take a closer look at real-world examples like CleanLab, who saved 90% on GPU costs using Inferless.Whether you’re a developer, DevOps engineer, or tech enthusiast curious about the latest in AI infrastructure, this podcast offers insights into Kubernetes-based model deployment, efficient updates, and the future of serverless ML. Tune in to hear Nilesh's journey from Amazon to founding Inferless and how his platform is transforming the way companies deploy machine learning models.Subscribe now for more episodes! Show Links:OpenShift 4.17 is GA https://www.youtube.com/live/DvKHwz-c11c?si=6Zap6hk_GsQfdX2mPolicy SBOM from Styra: https://www.styra.com/blog/introducing-policy-sbom/NVIDIA GEForce NOW runs on KubeVirt https://thenewstack.io/now-nvidia-scaled-its-cloud-services-with-kubevirt/CBT feedback https://thenewstack.io/kubernetes-advances-cloud-native-data-protection-share-feedbackCNCF KUBEEDGE Grad https://www.devopsdigest.com/cncf-announces-kubeedge-graduation?utm_source=tldrdevopsPalumi Operator 2.0 https://www.pulumi.com/blog/pulumi-kubernetes-operator-2-0Inferless LInks:https://www.inferless.com/blog/cleanlab-saves-90-on-gpu-costs-with-inferless-serverless-inferencehttps://www.inferless.com/blog/how-spoofsense-scaled-their-ai-inference-with-inferless-dynamic-batching-autoscalinghttps://www.inferless.com/ https://docs.inferless.com/introduction/introduction LinkedIn - https://www.linkedin.com/in/nilesh-agarwal/ X- https://x.com/nilesh_agarwal2 Medium Blog https://nilesh-agarwal.medium.com/
Running Ray on Kubernetes with KubeRay
05-09-2024
Running Ray on Kubernetes with KubeRay
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Kai-Hsun Chen, Software Engineer at Anyscale and maintainer of the KubeRay project. The discussion focuses on how the open source Ray project can help organizations use a single tool for data prep, model training, fine tuning and model serving workflows, both for their predictive AI and generative AI models. The discussion also dives into the KubeRay project and how it provides three different Kubernetes CRDs for Data Scientists to deploy Ray clusters on demand.   Check out our website at https://kubernetesbytes.com/  Cloud Native News:https://azure.github.io/AKS/2024/08/23/fine-tuning-language-models-with-kaitohttps://orca.security/resources/blog/kubernetes-testing-environment/https://www.redhat.com/en/about/press-releases/red-hat-openstack-services-openshift-now-generally-available  Show links:Kai's LinkedIn: https://www.linkedin.com/in/kaihsun1996/KubeRay doc: https://docs.ray.io/en/latest/cluster/kubernetes/index.htmlRay Summit registration: https://raysummit.anyscale.com/flow/anyscale/raysummit2024/reg/createaccount (code: KaiHsunC15)KubeRay repository: https://github.com/ray-project/kuberayRay repository: https://github.com/ray-project/rayRay Slack workspace: https://docs.google.com/forms/d/e/1FAIpQLSfAcoiLCHOguOm8e7Jnn-JJdZaCxPGjgVCvFijHB5PLaQLeig/viewform  Timestamps: 00:02:40 Cloud Native News 00:07:20 Interview with Kai 00:49:15 Key takeaways
Deploy and fine-tune LLM models on Kubernetes using KAITO
07-08-2024
Deploy and fine-tune LLM models on Kubernetes using KAITO
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with  Sachi Desai, Product Manager and Paul Yu, Sr. Cloud Advocate at Microsoft to talk about the open source KAITO project. KAITO is the Kubernetes AI Toolchain Operator that enables AKS users to deploy open source LLM models on their Kubernetes clusters. They discuss how KAITO helps with running AI-enabled applications alongside the LLM models, how it helps users bring their own LLM models and run them as containers, and how KAITO helps them fine-tune open source LLMs on their Kubernetes clusters.  Check out our website at https://kubernetesbytes.com/  Cloud Native News:  https://azure.github.io/AKS/2024/07/30/azure-container-storage-gahttps://github.blog/news-insights/product-news/introducing-github-models/  Show links: Azure/kaito: Kubernetes AI Toolchain Operator - https://github.com/Azure/kaito/tree/mainhttps://www.youtube.com/watch?v=3cGmHDjR_3I&list=PLc3Ep462vVYtgN4rP1ThTJd2UlsBc2sou&index=2https://aka.ms/cloudnative/learnlive/intelligent-apps-on-aks/episode-2Jumpstart AI Workflows With Kubernetes AI Toolchain Operator - The New Stack - https://thenewstack.io/jumpstart-ai-workflows-with-kubernetes-ai-toolchain-operatorhttps://paulyu.dev/article/soaring-with-kaito/Concepts - Fine-tuning language models for AI and machine learning workflows - Azure Kubernetes Service | Microsoft Learn - https://learn.microsoft.com/en-us/azure/aks/concepts-fine-tune-language-models  Keep up to date on the most recent announcements by following some of the KAITO engineers on LinkedIn: Fei Guo - https://www.linkedin.com/in/fei-guo-a48319a/Ishaan Sehgal - https://www.linkedin.com/in/ishaan-sehgal/ Timestamps: 00:02:15 Cloud Native News 00:05:34 Interview with Sachi and Paul 00:42:08 Key takeaways
The evolution of service mesh technologies
17-05-2024
The evolution of service mesh technologies
In this episode of the Kubernetes Bytes podcast, Ryan and Bhavin talk to Christian Posta - VP and Global Field CTO at Solo.io about all things Service Mesh. They discuss how things have evolved from the early Linkerd days to sidecar less istio service mesh implementations. They also talk about how service mesh can help you connect to application components running outside Kubernetes, and how developers and platform engineers have a shared responsibility model when it comes to implementing service mesh using internal developer platforms.   Check out our website at https://kubernetesbytes.com/  Episode Sponsor: Nethopper Learn more about KAOPS:  @nethopper.io For a supported-demo:  info@nethopper.io Try the free version of KAOPS now!   https://mynethopper.com/auth  Cloud Native News:  https://loft.sh/blog/our-24m-series-a-led-by-khosla-ventures/https://www.harness.io/blog/celebrating-150m-in-new-financing-to-accelerate-innovationhttps://www.akamai.com/newsroom/press-release/akamai-announces-intent-to-acquire-api-security-company-nonamehttps://www.linkedin.com/posts/rouvenbesters_its-official-the-otomi-platform-has-activity-7194604616901120000-48g7?utm_source=share&utm_medium=member_desktophttps://www.wiz.io/blog/celebrating-our-1-billion-funding-round-and-12-billion-valuation Show Links: https://devsummit.infoq.com/conference/boston2024 https://www.solo.io/topics/cakes-stack/https://www.solo.io/  Timestamps: 00:06:10 Cloud Native News 00:15:37 Interview with Torsten 01:01:58 Key takeaways
What are Vector Databases
06-05-2024
What are Vector Databases
In this episode of the Kubernetes Bytes podcast, Ryan and Bhavin talk to Torsten Steinbach - VP, Chief Architect for Analytics & AI at EDB about all things Vector Databases, Postgres, and why Data is important for building AI platforms. The discussion dives into how vector databases are different than relational databases and why using Postgres extensions helps organizations use their existing data for AI applications.  Check out our website at https://kubernetesbytes.com/  Kubernetes Community Days (KCD) in New York City on May 22nd, use the promo code “KUBERNETESBYTES” to get a 10% discount on your registration fees!   Episode Sponsor: Nethopper Learn more about KAOPS:  @nethopper.io For a supported-demo:  info@nethopper.io Try the free version of KAOPS now!   https://mynethopper.com/auth Cloud Native News:  https://www.reuters.com/markets/deals/ibm-nearing-buyout-deal-hashicorp-wsj-reports-2024-04-23/https://www.wiz.io/blog/wiz-acquires-gem-security-to-reinvent-threat-detection-in-the-cloudhttps://techcrunch.com/2024/04/18/wiz-is-in-talks-to-buy-lacework-for-150-200m-security-firm-was-last-valued-at-8-3b/https://www.prnewswire.com/news-releases/coreweave-secures-1-1-billion-in-series-c-funding-to-drive-the-next-generation-of-cloud-computing-for-the-future-of-ai-302133328.htmlhttps://kubernetes.io/blog/2024/04/17/kubernetes-v1-30-release https://dok.community/blog/become-a-data-on-kubernetes-in-2024-ambassador/   Show Links: https://www.enterprisedb.com/news/edb-acquires-splitgraph https://www.enterprisedb.com/resources/eventshttps://www.enterprisedb.com/  Timestamps: 00:03:22 Cloud Native News 00:17:45 Interview with Torsten 00:57:00 Key takeaways
KubeCon EU Paris News Recap
16-04-2024
KubeCon EU Paris News Recap
Join Bhavin Shah and Ryan Wallner for a recap of announcments and news from KubeCon Paris 2024.Kubernetes Community Days (KCD) in New York City on May 22nd, use the promo code “KUBERNETESBYTES” to get a 10% discount on your registration fees!NethopperLearn more about KAOPS:  @nethopper.io For a supported-demo:  info@nethopper.ioTry the free version of KAOPS now!   https://mynethopper.com/authNewshttps://about.gitlab.com/blog/2024/03/20/oxeye-joins-gitlab-to-advance-application-security-capabilities/https://www.redhat.com/en/blog/unveiling-red-hat-openshift-415https://developer.nvidia.com/blog/nvidia-nim-offers-optimized-inference-microservices-for-deploying-ai-models-at-scale/https://www.acorn.io/resources/blog/our-new-focus-developing-an-llm-app-platform-based-on-gpt-script-technology?fromOther=truehttps://loft.sh/blog/deliver-secure-kubernetes-multi-tenancy-with-new-vcluster-in-rancher-integration/https://www.observeinc.com/blog/stepping-on-the-gas/https://thenewstack.io/kubecost-2-2-covers-carbon-cost-monitoring-and-more/https://thenewstack.io/ovhcloud-unveils-roadmap-to-take-on-hyperscalers-from-europe/https://www.suse.com/c/meet-rancher-prime-3-0/https://www.suse.com/c/suse-releases-edge-3-0-highly-validated-edge-optimized-stack/https://www.fermyon.com/blog/introducing-spinkube-fermyon-platform-for-k8s https://www.cncf.io/blog/2024/03/19/announcing-the-ai-working-groups-new-cloud-native-artificial-intelligence-whitepaper/ https://github.com/Azure/kaito https://azure.microsoft.com/en-us/updates/public-preview-kubernetes-ai-toolchain-operator-kaito-addon-for-aks/https://cloudnativenow.com/features/solo-io-delivers-on-cilium-support-promise-for-gloo-networks/ https://docs.solo.io/gloo-network/latest/about/overview/ https://github.com/kosmos-io/kosmos https://gateway.envoyproxy.io/blog/2024/03/14/announcing-envoy-gateways-1.0-release/ https://newrelic.com/press-release/20240319 https://siliconangle.com/2024/03/29/aviatrix-revolutionizes-networking-security-distributed-cloud-firewall-kubernetes-kubeconeu/
Generative AI on Kubernetes
12-03-2024
Generative AI on Kubernetes
In this episode of the Kubernetes Bytes podcast, Ryan and Bhavin sit down with Janakiram MSV - an advisor, analyst and architect to talk about how users can run Generative AI models on Kubernetes. The discussion revolves around Jani's home lab and his experimentation with different LLM models and how to get them running on NVIDIA GPUs. Jani has spent the past year becoming a subject matter expert in GenAI, and this discussion highlights all the different challenges he faced and what lessons he learnt from them.   Check out our website at https://kubernetesbytes.com/  Episode Sponsor: Elotl  https://elotl.co/lunahttps://www.elotl.co/luna-free-trial  Timestamps: 02:02 Cloud Native News 15:31 Interview with Jani 01:11:00 Key takeaways  Cloud Native News: https://www.techerati.com/press-release/octopus-deploy-acquires-codefresh-to-boost-kubernetes-and-cloud-native-delivery/https://www.civo.com/blog/kubefirst-joins-civo  https://cast.ai/kubernetes-cost-benchmark https://www.techradar.com/pro/vmware-customers-are-jumping-ship-as-broadcom-sales-continue-heres-where-theyre-moving-to https://cloudonair.withgoogle.com/events/techbyte-making-ai-ml-scalable-cost-effective-gkehttps://dok.community/dok-events/dok-day-kubecon-paris/ https://training.linuxfoundation.org/certification/certified-argo-project-associate-capa   Show Links: https://www.youtube.com/janakirammsv https://www.linkedin.com/in/janakiramm/- NVIDIA Container Toolkit - https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/index.htmlNVIDIA Device Plugin - https://github.com/NVIDIA/k8s-device-pluginNVIDIA Feature Discovery - https://github.com/NVIDIA/gpu-feature-discoveryHugging Face Text Gen Inference - https://huggingface.co/docs/text-generation-inference/indexHugging Face Text Embeddings Inference - https://huggingface.co/docs/text-embeddings-inference/indexChromaDB - https://www.trychroma.com/