Open||Source||Data - podcast cover

Open||Source||Data

Charna Parkeywww.datastax.com
What can we learn from ai-native development through stimulating conversations with developers, regulators, academics and people like you that drive forward development, seek to understand impact, and are working to mitigate risk in this new world? Join Charna Parkey and the community shaping the future of open source data, open source software, data in AI, and much more.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Throwback: Open Source Innovation, The GPL for Data, and The Data In to Data Out Ratio with Larry Augustin

This episode features an interview with Larry Augustin, angel investor and advisor to early-stage technology companies. Larry previously served as the Vice President for Applications at AWS, where he was responsible for application services like Pinpoint, Chime, and WorkSpaces. Before joining AWS, Larry was the CEO of SugarCRM, an open source CRM vendor. He also was the founder and CEO of VA Linux, where he launched SourceForge. Among the group who coined the term “open source”, Larry has sat on...

Oct 18, 202341 minSeason 5Ep. 9

Reframing Machine Learning and AI-Assisted Development with Jorge Torres

This episode features an interview with Jorge Torres, Co-founder and CEO of MindsDB. MindsDB is a virtual AI database that works with existing data to help developers build AI-centered apps. In 2008, Jorge began his work on scaling solutions using machine learning as the first full-time engineer at Couchsurfing, growing the company from a few thousand users to a few million. He has also served a number of data-intensive start-ups and was a visiting scholar at UC Berkeley researching machine lear...

Sep 27, 202345 minSeason 5Ep. 8

A Sam Ramji Feature: The Evolution of Open Source, Kubernetes, and AI's Forward Journey

On this episode, we’ve partnered with the Future Rodeo podcast for a discussion between Sam and Matt Wallace. Matt is the Chief Technology Officer and EVP at Faction, a pioneer of multi-cloud data services, and host of Future Rodeo. In this episode, Sam and Matt discuss Microsoft’s transformation, the impact of Kubernetes on container orchestration, and the rapid acceleration of AI research and development. ------------------- Episode Timestamps: (01:38): Microsoft’s open source transformation (...

Sep 06, 20231 hr 10 min

The Importance of Open Source Data for Generative AI, Now and in the Future with Abby Kearns

This episode features an interview with Abby Kearns, technology executive, board director, and angel investor. Her career has spanned executive leadership, product marketing, product management, and consulting across Fortune 500 companies and startups, including Puppet, Cloud Foundry Foundation, and Verizon. Abby currently serves as a board director for Lightbend, Stackpath, and Invoke. In this episode, Sam sits down with Abby to discuss the betrayal source license, the role open source plays in...

Aug 23, 202346 minSeason 5Ep. 7

The Value of Reproducibility and Ease of AI Deployment with Daniel Lenton

This episode features an interview with Daniel Lenton, Founder and CEO of Ivy, where the team is on a mission to unify the fragmented AI stack. Prior to Ivy, Daniel was a Robotics Research Engineer at Dyson and a Deep Learning Research Scientist for Amazon Prime Air. During his PhD, Daniel explored the intersection between learning-based geometric representations, ego-centric perception, spatial memory, and visuomotor control for robotics. In this episode, Sam and Daniel discuss the inspiration ...

Aug 09, 202334 minSeason 5Ep. 6

ML Engineering Teams and Niche Chat Bot Experiences with Demetrios Brinkmann

This episode features an interview with Demetrios Brinkmann, Founder of the MLOps Community, an organization for people to share best practices around MLOps. Demetrios fell into the Machine Learning Operations world and has since interviewed leading names around MLOps, data science, and machine learning. In this episode, Sam sits down with Demetrios to discuss LLM in production use cases, ML engineering teams, and the LLM Survey Report from the MLOps Community. ------------------- "I think the m...

Jul 26, 202350 minSeason 5Ep. 5

Building With Trust, Inspiration, and Reputation with Jaya Gupta, Yuliia Tkachova, and Omoju Miller

This bonus episode features conversations from season 5 of the Open||Source||Data podcast. In this episode, you’ll hear from Jaya Gupta, Partner at Foundation Capital; Yuliia Tkachova, Co-founder and CEO of Masthead Data; and Omoju Miller, Founder and CEO of Fimio. Sam sat down with each guest to discuss how they are building foundations for trust, inspiration, and reputation as we all race into the AI-centric future. You can listen to the full episodes from Jaya Gupta, Yuliia Tkachova, and Omoj...

Jul 12, 20234 min

FMOps and a Founders Automated Future with Jaya Gupta

This episode features an interview with Jaya Gupta, Partner at Foundation Capital, where she leads early-stage investments across the enterprise software stack. Previously, Jaya was a Senior Business Analyst at McKinsey & Company focusing on software diligence and helping startups expand their go-to-market strategies. In this episode, Sam and Jaya discuss her journey to Foundation Model Ops, how software is becoming more accessible, and the democratization of AI tools. ------------------- "A...

Jun 28, 202334 minSeason 5Ep. 4

Web3 and Putting Reputation on Code with ML with Omoju Miller

This episode features an interview with Omoju Miller, Founder and CEO of Fimio, a web3 reputation company. Originally from Lagos, Nigeria, Omoju holds a doctoral degree in Computer Science Education from UC Berkeley. Her expertise in machine learning and computational intelligence led her to companies such as Google and GitHub. Omoju also served as a volunteer advisor to the Obama administration’s White House Presidential Innovation Fellows. In this episode, Sam sits down with Omoju to discuss h...

May 31, 20231 hr 2 minSeason 5Ep. 3

The Human Right to Privacy and Caring About UX Design with Yuliia Tkachova

This episode features an interview with Yullia Tkachova, Co-founder and CEO of Masthead Data, an observability platform that catches anomalies in Google BigQuery in real-time. She holds degrees in Management Information Systems, Math, Statistics, and Marketing. Prior to Masthead, Yuliia designed complex BI products and solutions powered by ML and utilized by Fortune 500 companies. In this episode, Sam and Yuliia discuss how ML is shaping the future of data analytics, caring about users, and the ...

May 17, 202347 minSeason 5Ep. 2

Determinism in Complex Environments and Workflow Services with Maxim Fateev

This episode features an interview with Maxim Fateev, Co-founder and CEO of Temporal, an open source, distributed, and scalable workflow orchestration engine capable of running millions of workflows. He has 20 years of experience architecting mission-critical systems at Uber, Google, Amazon, and Microsoft. In this episode, Sam sits down with Maxim to discuss workflow services, the power behind Temporal, and bringing determinism to highly complex environments. ------------------- “[Temporal] has ...

May 03, 202342 minSeason 5Ep. 1

The AI-Native Stack in Practice with Charna Parkey and Sam Bean

This episode features a panel discussion with Charna Parkey, a Real-Time AI Product and Strategy leader at DataStax; and Sam Bean, Staff Engineer at You.com. Charna is a co-author and inventor on several patents, including patent-pending work on ML/coordinated feature engine at the edge. Sam helped create the Spark connector to Weaviate, and is passionate about Big Data, Spark, NLP, Hugging Face, and large language models. In this episode, Charna and Sam discuss adapting to user expectations, wh...

Mar 15, 20231 hr 6 minSeason 4Ep. 13

The AI-Native Stack with Mikiko Bazeley, Zain Hasan, and Tuana Celik

This episode features a panel discussion with Mikiko Bazeley, Head of MLOps at Featureform; Zain Hasan, Senior Developer Advocate at Weaviate; and Tuana Celik, Developer Advocate at deepset. In this episode, Mikiko, Zain, and Tuana discuss what open source data means to them, how their companies fit into the AI-first ecosystem, and how jobs will need to evolve with the AI-native stack. ------------------- “We're almost part of a fancy new AI robot kitchen that you'd find in Tokyo, in some ways. ...

Mar 01, 202357 minSeason 4Ep. 12

Special Episode: Data on Kubernetes and Cassandra Forward with Patrick McFadin

This special episode of Open||Source||Data features an interview with Patrick McFadin. Patrick has been a distributed systems hacker since he first plugged a modem into his Atari computer. Looking for adventure, he joined the US Navy, working on the Naval Tactical Data System (NTDS), which cemented his love of distributed systems. He is now an Apache Cassandra Committer, and is the Vice President of Developer Relations at DataStax. Sam catches up with Patrick at Data Day Texas to discuss his boo...

Feb 22, 202319 min

Making Graph Data Easier with Open Initiatives with Denise Gosnell

This episode features an interview with Denise Gosnell, Principal Product Manager at Amazon Web Services. At AWS, Denise leads product and strategy for Amazon Neptune, a fully managed graph database service. Her career centers on her passion for examining, applying, and advocating for the applications of graph data. Denise has also authored, patented, and spoken on graph theory, algorithms, databases, and applications across all industry verticals. In this episode, Sam sits down with Denise to d...

Feb 15, 202340 minSeason 4Ep. 11

Advising Big Data and The Future of AI/ML with Ben Lorica

This episode features an interview with Ben Lorica, Co-founder and Principal of Gradient Flow, a company that provides a wide range of content on data and technology. Ben is an industry expert on data, machine learning, and AI. He is a Technical Advisor for Databricks, a program chair for several data conferences, and he hosts The Data Exchange Podcast. In this episode, Sam and Ben discuss Big Data and the improvements and future opportunities of AI and machine learning. ------------------- “The...

Feb 01, 202348 minSeason 4Ep. 10

Functional Programming and an Ideal Data Stack Building Experience with Holden Karau

This episode features an interview with Holden Karau, an Open Source Engineer at Netflix. Holden is best known for her work on Apache Spark, her advocacy in the open source software movement, and her creation of a variety of related projects including spark-testing-base. Previously, Holden worked at Big Tech companies like Apple, IBM, and Google as a software engineer and developer advocate. In this episode, Sam sits down with Holden to discuss the data analysis stack, functional programming, an...

Jan 18, 202345 minSeason 4Ep. 9

Workflow Engines and Building a Domain Specific Language for Data Quality with Tom Baeyens

This episode features an interview with Tom Baeyens, Co-founder and CTO of Soda, where he oversees the company's product development, software architecture, and technology strategy. He is passionate about open source and committed to building a community where data engineers can succeed using the Soda Data Monitoring Platform. Tom is the inventor of the widely-used open source project jBPM and Activiti. He also co-founded Effektif, a cloud process automation company. In this episode, Sam and Tom...

Jan 04, 202334 minSeason 4Ep. 8

Enabling Edge Workers, AI & ML, and The Future of Data Science with Matthew Rocklin

This episode features an interview with Matthew Rocklin, CEO of Coiled, the scalable Dask-based cloud platform. Prior to founding Coiled, Matthew worked on Dask at Anaconda and then NVIDIA where his teams focused on accelerating Dask through parallel computing and GPUs. Matthew is an industry speaker, author, and founding member of Pangeo, whose mission is to develop open source analysis tools for ocean, atmosphere, and climate science. In this episode, Sam sits down with Matthew to discuss enab...

Dec 14, 202244 minSeason 4Ep. 7

OSPOs, Measuring Community Success, and Self Knowledge with Nithya Ruff

This episode features an interview with Nithya Ruff, Head of Open Source Program Office at Amazon. At Amazon, she drives open source culture and coordination and engagement with external communities. Prior to Amazon, Nithya spearheaded and grew Open Source Program Offices (OSPOs) for Comcast and Western Digital. She has also served as the Director-At-Large on the Linux Foundation Board since 2016, where she works to advance the mission of building sustainable ecosystems that are built on open co...

Dec 07, 202235 minSeason 4Ep. 6

IoT Databases, Digital Twins, and Real Holodecks with Jonathan Beri

This episode features an interview with Jonathan Beri, Founder & CEO of Golioth, a commercial IoT development platform built for scale. Previously, Jonathan was a Product Manager at Particle, Google/Nest, Magneto, and Myspace where he spent his time building IoT solutions. In this episode, Sam sits down with Jonathan to discuss the concept of digital twins, the future of IoT databases, and how to build a real holodeck. ------------------- “I think about IoT when I started at Nest, we had som...

Nov 23, 202237 minSeason 4Ep. 5

Healthcare Infrastructure, ALS Research and Reliable Data with Indu Navar

This episode features an interview with Indu Navar, CEO and Founder of EverythingALS, a patient-driven non-profit, bringing technological innovations and data science to support efforts from care to cure, for people with ALS. Indu’s impressive career includes being an original member of the WebMD engineering team, where she was instrumental in using emerging technologies to achieve application scalability and performance. In this episode, Sam sits down with Indu to discuss healthcare infrastruct...

Nov 09, 202246 minSeason 4Ep. 4

Shifting Left on Data with DeVaris Brown, Tomer Shiran, and Erica Brescia

This bonus episode features conversations from season 3 of the Open||Source||Data podcast. In this episode, you’ll hear from DeVaris Brown, CEO & Co-founder of Meroxa; Tomer Shiran, Founder & CPO of Dremio; and Erica Brescia, Managing Director at Redpoint Ventures. Sam sat down with each guest to discuss how they’re making data more programmable by shifting left. You can listen to the full episodes from DeVaris Brown, Tomer Shiran, and Erica Brescia by clicking the links below. ---------...

Nov 02, 20223 min

Serial Entrepreneurship, Metadata Capture Systems, and Osquery with Tony Gauda

This episode features an interview with Tony Gauda, Head of Customer Engineering at Fleet Device Management, an open core company powered by Osquery. Tony is a serial entrepreneur and inventor with a profound history in fraud, security, and SaaS business. He holds several issued patents and his companies have raised over $40 million in venture funding. Tony is also the founder of ThinAir, a Y-Combinator backed SaaS service that tackles the insider threat problem for enterprises and government ag...

Oct 26, 202234 minSeason 4Ep. 3

Code Intelligence, GraphQL, and Closing the Remediation Gap with Beyang Liu

This episode features an interview with Beyang Liu, CTO and Co-founder of Sourcegraph, a code intelligence platform. Prior to Sourcegraph, Beyang was a software engineer at Palantir Technologies, where he developed new data analysis software on a customer-facing team working with Fortune 500 companies. Beyang studied Computer Science at Stanford, where he published research in probabilistic graphical models and computer vision at the Stanford AI Lab. In this episode, Sam sits down with Beyang to...

Oct 12, 202235 minSeason 4Ep. 2

Stream Processing, Observability, and the User Experience with Eric Sammer

This episode features an interview with Eric Sammer, CEO of Decodable. Eric has been in the tech industry for over 20 years, holding various roles as an early Cloudera employee. He also was the co-founder and CTO of Rocana, which was acquired by Splunk in 2017. During his time at Splunk, Eric served as the VP and Senior Distinguished Engineer responsible for cloud platform services. In this episode, Sam and Eric discuss the gap between operating infrastructure and the analytical world, stream pr...

Sep 28, 202243 minSeason 4Ep. 1

Season 3 Compressed Edition with Sam and Audra

Join Open||Source||Data executive producer Audra Montenegro as she and Sam discuss his learnings and takeaways from this season and what the future of open source data looks like. ------------------- “There's such an open conversation about, ‘Yeah, open source,’ we usually think about open source software. How can we cross apply more of what we think about in software in general into data, and then what is it that's totally new about this domain? So, the answers cluster into three groups. It's e...

Jul 20, 202216 minSeason 3Ep. 13

Accelerating Computation, Machine Learning, and Data Mesh with Sophie Watson

This episode features an interview with Sophie Watson, Technical Product Marketing Manager at NVIDIA. Previously, Sophie served as a software engineer and principal data scientist at RedHat where she used machine learning to solve business problems in the hybrid cloud. Sophie has a PhD in Bayesian statistics and frequently speaks about machine learning workflows on Kubernetes, recommendation engines, and machine learning for search. In this episode, Sam and Sophie discuss Principal Component Ana...

Jul 06, 202239 minSeason 3Ep. 12

Democratization and Cognition with Margot Gerritsen, Rachel Chalmers, and Patricia Boswell

This bonus episode features conversations from season 1 of the Open||Source||Data podcast. In this episode, you’ll hear from Margot Gerritsen, Stanford Professor and Co-Founder/Director of WiDS; Rachel Chalmers, Partner at Alchemist Accelerator; and Patricia Boswell, Staff Technical Writer at Google. Sam sat down with each guest to discuss cognition and democratization in data. You can listen to the full episodes from Margot Gerritsen, Rachel Chalmers, and Patricia Boswell by clicking the links ...

Jun 29, 20226 min

Vector Search, the AI Stack and more with Bob van Luijt

This episode features an interview with Bob van Luijt, CEO and Co-Founder of SeMI Technologies and co-creator of Weaviate, an open source vector search engine. At just 15 years of age, Bob started his own software company in the Netherlands. He went on to study music at ArtEZ University of the Arts and Berklee College of Music, and completed the Harvard Business School Program of Management Excellence. Bob is also a TedX speaker, discussing the relationship between software and language. In this...

Jun 22, 202236 minSeason 3Ep. 11
For the best experience, listen in Metacast app for iOS or Android