Open||Source||Data - podcast cover

Open||Source||Data

Charna Parkeywww.datastax.com
What can we learn from ai-native development through stimulating conversations with developers, regulators, academics and people like you that drive forward development, seek to understand impact, and are working to mitigate risk in this new world? Join Charna Parkey and the community shaping the future of open source data, open source software, data in AI, and much more.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Open Source Innovation, The GPL for Data, and The Data In to Data Out Ratio with Larry Augustin

This episode features an interview with Larry Augustin, angel investor and advisor to early-stage technology companies. Larry previously served as the Vice President for Applications at AWS, where he was responsible for application services like Pinpoint, Chime, and WorkSpaces. Before joining AWS, Larry was the CEO of SugarCRM, an open source CRM vendor. He also was the founder and CEO of VA Linux, where he launched SourceForge. Among the group who coined the term “open source”, Larry has sat on...

Jun 08, 202240 minSeason 3Ep. 10

Data Observability with Barr Moses, Einat Orr, and Shinji Kim

This bonus episode features conversations from season 2 of the Open||Source||Data podcast. In this episode, you’ll hear from Barr Moses, Co-founder and CEO at Monte Carlo; Einat Orr, Co-founder and CEO at Treeverse; and Shinji Kim, Founder and CEO at Select Star. Sam sat down with each guest to discuss data observability. You can listen to the full episodes from Barr Moses, Einat Orr, and Shinji Kim by clicking the links below. ------------------- Episode Timestamps: (00:35): Barr Moses (01:21):...

Jun 01, 20224 min

Apache Pinot and Real-Time Analytics with Neha Pawar

This episode features an interview with Neha Pawar, a Founding Engineer at StarTree. StarTree is a software development company that focuses on democratizing data for all users by providing real-time, user-facing analytics. Prior to her time at StarTree, Neha was a Senior Software Engineer on LinkedIn’s Data Analytics team where she spent five years working on Apache Pinot. Neha has provided countless contributions to Pinot over the years, focusing on real-time streaming integrations, ingestion,...

May 25, 202241 minSeason 3Ep. 9

Real-Time Data, Enabling Developers, and User Experience with DeVaris Brown

This episode features an interview with DeVaris Brown, CEO and Co-Founder of Meroxa. Meroxa was founded in 2020 and enables teams of any size and any expertise to build real-time data pipelines in minutes. Previously, DeVaris was a product leader at Twitter, Heroku, and Zendesk. Sam and DeVaris even crossed paths at Microsoft in the aughts. In this episode, Sam and DeVaris discuss enabling developers, real-time data, and providing the ultimate user experience. ------------------- "From the begin...

May 11, 202240 minSeason 3Ep. 8

Data Meshes, Fabrics, and Discovery with Zhamak Dehghani, David Thomas, and Shirshanka Das

This bonus episode features conversations from season 1 and 2 of the Open||Source||Data podcast. In this episode, you’ll hear from Zhamak Dehghani, Director of Emerging Technologies at ThoughtWorks North America; David Thomas, Principal at Deloitte; and Shirshanka Das, Founder of LinkedIn DataHub and Acryl Data. Sam sat down with each guest to discuss data meshes, fabrics, and discovery. You can listen to the full episodes from Zhamak Dehghani, David Thomas, and Shirshanka Das by clicking the li...

May 04, 20224 min

Investing in Communities, Differentiating, and Trusting Your Gut with Erica Brescia

This episode features an interview with Erica Brescia, Managing Director of Redpoint Ventures. At Redpoint, Erica focuses her investing on infrastructure, DevOps, and security. Erica has over 15 years of experience in the open source community and currently serves on the board of directors of the Linux Foundation. Prior to joining Redpoint, Erica was also an angel investor and advisor to companies such as Netlify, Coda, and Xata. In this episode, Sam and Erica discuss the evolution of open sourc...

Apr 27, 202235 minSeason 3Ep. 7

Data on Kubernetes with Kelsey Hightower, Lachlan Evenson, and Patrick McFadin

This bonus episode features conversations from season 1 of the Open||Source||Data podcast. In this episode, you’ll hear from Kelsey Hightower, Principal Engineer at Google Cloud; Lachlan Evenson, Principal Program Manager at Microsoft Azure; and Patrick McFadin, Head of Developer Relations at DataStax. Sam sat down with each guest to discuss Data on Kubernetes and how they’re making progress on a stateless infrastructure. You can listen to the full episodes from Kelsey Hightower, Lachlan Evenson...

Apr 20, 20224 min

Deep Fakes, Responsible Data Science, and Trust with David Danks

This episode features an interview with David Danks, Professor of Data Science and Philosophy and affiliate faculty in Computer Science and Engineering at University of California, San Diego. Prior to UCSD, David was the L.L. Thurstone Professor of Philosophy and Psychology at Carnegie Mellon University. David’s research interests are at the intersection of philosophy, cognitive science, and machine learning. He has also examined the ethics surrounding artificial intelligence in the fields of he...

Apr 13, 202245 minSeason 3Ep. 6

Cloud Innovation, Analytics, and Data Transformation with Monica Kumar

This episode features an interview with Monica Kumar, Senior Vice President of Marketing and Cloud-Go-To Market at Nutanix. Nutanix is a data platform that is redefining workloads in cloud environments. Prior to Nutanix, Monica spent two decades at Oracle where she launched several market solutions. Monica is passionate about positioning and supporting women in leadership roles. She is a founding limited partner of Neythri Futures Fund, a venture fund dedicated to bringing South Asian women into...

Mar 30, 202237 minSeason 3Ep. 5

Data Lakehouses, Interoperability, and Accessibility with Tomer Shiran

This episode features an interview with Tomer Shiran, Founder and Chief Product Officer at Dremio. Dremio is a high-performance SQL lakehouse platform that helps companies get more from their data in the fastest way possible. Prior to Dremio, Tomer served as VP of Product at MapR and also held product management and engineering roles at Microsoft and IBM Research. He also has a master’s degree from Carnegie Mellon University as well as a bachelor’s from Technion - Israel Institute of Technology....

Mar 16, 202230 minSeason 3Ep. 4

Interoperability, Governance, and Divergent Teams with Prukalpa Sankar

This episode features an interview with Prukalpa Sankar, Co-Founder of Atlan. Atlan is a venture-backed startup building a modern data workspace. Prukalpa also co-founded SocialCops, a data for good company behind landmark projects such as India’s National Data Platform. Prukalpa is a recognized industry leader, landing on the Forbes 30 Under 30 list and Fortune’s 40 Under 40. In this episode, Prukalpa and Sam discuss how diversity is a data team’s biggest strength, why governance isn’t always a...

Mar 02, 202231 minSeason 3Ep. 3

Trust, Automation, and Trade-Offs with Joseph Jacks

This episode features an interview with Joseph Jacks, Founder and General Partner of OSS Capital. OSS Capital is the first and only COSS (Commercial Open Source Software) company investor that focuses on supporting early-stage COSS founders. Joseph, also known as JJ, has worked at Mesosphere, TIBCO Software, and Talend in various sales, engineering, and strategy roles. In this episode, JJ and Sam weigh the trade-offs of open and closed core companies and discuss how each can go public. JJ also d...

Feb 16, 202237 minSeason 3Ep. 2

Open Source, Adoptability, and Name Changes with Martin Traverso

This episode features an interview with Martin Traverso, CTO at Starburst Data and Co-founder of Trino, a lightning fast distributed SQL query engine. Martin was previously a software engineer at Facebook where he led the Presto (now Trino) development team. Trino has gained worldwide adoption from companies like Netflix, Amazon, and LinkedIn. In this episode, Martin sits down with Sam to discuss the barriers, advantages, and complications of going open-source. Episode Notes -Guest Quote [33:55]...

Feb 02, 202247 minSeason 3Ep. 1

Embeddings, Feature stores, and MLOps with Simba Khadder

Join CEO of Featureform, Simba Khadder as he talks with Sam about how versioning, immutability, and sharing will accelerate ML workflows. Tune-in on state of the art collaboration in data teams, and the power of focusing on your north star. See omnystudio.com/listener for privacy information.

Oct 14, 202131 minSeason 2Ep. 10

Abundance, Metadata, and Automation with Mark Grover

How can we make data 10X more accessible for data-driven people within data-driven companies? Tune in to Mark and Sam discussing probabilistic product management, and the emerging metadata ecosystem. See omnystudio.com/listener for privacy information.

Sep 30, 202129 minSeason 2Ep. 9

Metadata, Communities, and Architecture with Shirshanka Das

How can we evolve an expanding ecosystem of data technologies while making sense of the whole? Tune in to LinkedIn DataHub, and Acryl Data founder, Shirshanka Das, as he and Sam have a discussion on metadata at the center and specialization at the edge to sustainably scale data governance. See omnystudio.com/listener for privacy information.

Sep 16, 202136 minSeason 2Ep. 8

Data Management Pain Points and Future Solutions for Data Discovery

Data discovery is one of the hardest problems to solve in data management in general and comes up as a major pain point in most data mesh discussions. Tune in to this all-star expert panel recorded in collaboration with the Data Mesh community, and hosted by a previous Open||Source||Data podcast guest, Paco Nathan of Derwen.ai . Paco engages panelists, Shinji Kim (Select Star), Sophie Watson (Red Hat), Mark Grover (Stemma), and Shirshanka Das (Acryl Data) in a 60-minute discussion on not only Da...

Sep 02, 202159 minSeason 2Ep. 7

ModelOps, ML Monitoring, and Busy Humans with Elena Samuylova

It’s 2 AM - do you know what your models are doing? Listen to Elena Samuylova as she talks to us about how to bridge the critical gaps between data scientists, engineers, and business managers using tooling and empathy. See omnystudio.com/listener for privacy information.

Aug 19, 202127 minSeason 2Ep. 6

Cloud-Native, Open-Source, and Collaborative with Eric Brewer and Melody Meckfessel

Google Fellow & VP of Infrastructure Eric Brewer, Observable CEO Melody Meckfessel, and DataStax Chief Strategy Officer Sam Ramji explore the state of the art, the near future, and grand challenges for the next decade in cloud-native data. See omnystudio.com/listener for privacy information.

Aug 05, 202136 minSeason 2Ep. 5

MLOps, AIOps, and Data Startups with Jocelyn Goldfein

Dealing with data hyperabundance, solving economic problems for businesses and changing lives for the better. Tune-in to Managing Director at Zetta Venture Partners, Jocelyn Goldfein as she and Sam have a discussion around engineering leadership, organizational graph structures, and productization of AI. See omnystudio.com/listener for privacy information.

Jul 22, 202132 minSeason 2Ep. 4

Git-Like Branch and Merge for Data with Einat Orr

What if you could version object storage just like code? Tune in to Einat Orr as she explains how CI/CD and data lineage are being transformed through versioning data, enabling sandboxes, safe rollbacks, and coherent history. See omnystudio.com/listener for privacy information.

Jul 08, 202128 minSeason 2Ep. 3

Data Discoverability, Products, and User Diversity with Shinji Kim

Learn how an accelerating abundance of data can be harnessed through telemetry. Tune-in while Shinji Kim and Sam explore opening data to more users, PageRank for tables, and pragmatic use of data lineage to find value. See omnystudio.com/listener for privacy information.

Jun 24, 202127 minSeason 2Ep. 2

Data Observability, Customer-Led Growth, and Confidence with Barr Moses

Barr Moses discusses with Sam about bringing DevOps into Data Engineering, building a data startup, and letting joy guide your way to creating impact. Learn how being data-driven depends on systems of people and trust. See omnystudio.com/listener for privacy information.

Jun 10, 202127 minSeason 2Ep. 1

Open Source Data & Its Role in the Future of Technology: Season 1 Recap

Wrapping up Season 1, Open||Source||Data producer Audra Montenegro Carter joins Sam Ramji in a conversation about the inspiration and behind-the-scenes production of the podcast, touching upon the top takeaways and lessons learned with Season 1 guests from AWS, Microsoft, ThoughtWorks, Deloitte, Observable, and many more. See omnystudio.com/listener for privacy information.

Apr 22, 202119 minSeason 1Ep. 16

Data Visualization, Democratization, and Javascript with Melody Meckfessel

Observable Co-Founder and CEO Melody Meckfessel joins Sam in a conversation on how millions of developers are changing how we experience data. Listen-in as Melody explains the importance of data literacy and the shift in data collaboration. See omnystudio.com/listener for privacy information.

Mar 25, 202130 minSeason 1Ep. 15

DataOps, MLOps, and Self Service: How Data Teams are Changing

Join Data Institute's Managing Director, Jesse Anderson to learn how data teams are changing in response to overwhelming demand for data products. Tune in as he and Sam discuss bringing software engineering into the domain of data - and why he wrote Data Teams. See omnystudio.com/listener for privacy information.

Mar 11, 202133 minSeason 1Ep. 14

Fabrics, Meshes, and Graphs with Deloitte Principal Dave Thomas

Join Dave and Sam as they discuss data sets evolving from finite to infinite, and finding the needle in the haystack with math. Listen to Dave talk about cutting edge data problems and the essential need for curious people. See omnystudio.com/listener for privacy information.

Feb 25, 202127 minSeason 1Ep. 13

Metadata, Graphs, and Responsible AI with Paco Nathan

Data Science player and coach, Author, and Venture Amplifier Paco Nathan talks with Sam Ramji about Hybrid AI, mathematical reversibility, and using AI to solve knowledge problems that the exponential growth of data will create for years to come. Join these two as they discuss how you can bring multiple data disciplines together using empathy and math. See omnystudio.com/listener for privacy information.

Feb 11, 202132 minSeason 1Ep. 12

Data Analytics: Hard Skills vs Soft Skills and the Gift of Thinking Different

Analytics manager, and Women in Data podcast producer and host Karen Jean-Francois walks us through the differences between Data Science and Analytics. Join her and Sam as they discuss valuable skills you’ll need when transitioning to a career in Data Analytics. Hear Karen’s perspective on the benefits of thinking differently and having a mentor to guide you through transitions. See omnystudio.com/listener for privacy information.

Jan 28, 202127 minSeason 1Ep. 11
For the best experience, listen in Metacast app for iOS or Android