Trino's initial use case was around replacing the Apache Hive runtime. As data lakes grew into prominence, it became clear that having a faster query engine didn't solve all problems. The Hive model itself was a huge bottleneck and didn't provide features that companies needed akin to data warehouses and databases. Apache Hudi is a new table format created out of Uber that aims to address many of these issues and usher in a new generation of data lake. Tune in as we speak to the Trino Hudi conne...
Nov 16, 2022•1 hr 43 min•Ep. 41
Join us for this next episode of the broadcast, where we bring back Ryan Blue, the creator of Iceberg, to discuss some of the latest happenings in the Iceberg community. We also discuss and demo a bunch of new features that have come out in the Trino Iceberg connector. We also have a new guest, Tabular Developer Advocate Sam Redai, shedding light on this incredible community as well! Since the first episodes, Iceberg has finalized the v2 spec and added a lot of new features along the way. Likewi...
Sep 12, 2022•1 hr 33 min•Ep. 40
In this episode we sit down with engineers, Steve Morgan and Edward Morgan, to discuss how they use Trino at Raft. Raft provides consulting services and is particularly skilled at DevSecOps. One particular challenge they face is dealing with fragmented government infrastructure. In this episode, we dive in to learn how Trino enables Raft to supply government sector clients with a data fabric solution. Raft takes a special stance on using and contributing to open source solutions that run well on...
Sep 08, 2022•1 hr 21 min•Ep. 39
We'll be doing a more focused look at a specific feature that's being added to Trino: polymorphic table functions. We're excited to talk about what they do, where we are so far, where we're going, and how you can leverage them to make Trino better than ever! Show Notes: https://trino.io/episodes/38.html Show Page: https://trino.io/broadcast/ YouTube Video: https://www.youtube.com/watch?v=90e5WxhwNas
Aug 17, 2022•1 hr 8 min•Ep. 38
This episode covers will introduce the benefits of having the Trino community around the Trino project. What is the purpose of communities in tech projects? Would the product be successful without a community or anyone to maintain it? We introduce some new faces that will be stewards in our journey to growing the adoption of our favorite query engine, what each of them does, and how their work impacts you as a community member! Most importantly, you can learn how to get involved and help us lear...
Aug 04, 2022•1 hr 38 min•Ep. 37
As Trino preps to jump to Java 17, we discuss the latest features added Java 11 to Java 17, talk with Martin through a few of the potential uses of new features like the Vector API, language improvements, and G1GC speedups, and finally, we will dive into discussing some of the features that we'll be implementing in the upcoming months under a new project in Trino! - Intro song: 00:00 - Intro: 00:36 - Releases: 8:17 - Question of the episode: Will Trino be making a vectorized C++ version of Trino...
Jun 16, 2022•1 hr 56 min•Ep. 36
In our Trino Community Broadcast episode 35 we are catching up on recent releases 375, 376, 377, and 378. We then talk about how Trino is packaged as tarball, rpm, and docker container, what some of the differences are, and how you can customize either of them. Beyond we also look for your feedback and input on usage of the different packages. As a next step we chat about adopting Java 17 is standard for Trino, and then we get a demo of a new feature of the web UI. - Intro song: 00:00 - Intro: 0...
May 24, 2022•1 hr 26 min•Ep. 35
News from the Trino releases 372, 373, and 374, and an update on Project Tardigrade are the start. Then we dive into the details of the new Delta Lake connector contributed to Trino by Starburst. - Intro song: 00:00 - Intro: 00:37 - Releases: 2:05 - Project Tardigrade update: 9:21 - Concept of the episode: A new connector for Delta Lake object storage. 18:37 - Pull requests of the episode: Add Delta Lake connector and documentation. 26:10 - Demo of the episode: Delta Lake connector in action. 29...
Mar 18, 2022•1 hr 2 min•Ep. 34
Goldman Sachs uses Trino to reduce last-mile ETL and provide a unified way of accessing data through federated joins. Making a variety of data sets from different sources available in one spot for our data science team was a tall order. Data must be quickly accessible to data consumers and systems like Trino must be reliable for users to trust this singular access point for their data. Join us on this next episode as we discuss with engineers from Goldman Sachs on how they integrated Trino and a...
Feb 28, 2022•1 hr 47 min•Ep. 33
- Intro Song: 00:00 - Intro: 00:33 - News: 8:02 - Concept of the week: K8s architecture: Containers, Pods, and kubelets: 14:27 - PR of the week: PR 11 Merge contributor version of k8s charts with the community version: 55:20 - Demo: Running the Trino charts with kubectl: 57:42 Show Notes: https://trino.io/episodes/24.html Show Page: https://trino.io/broadcast/
Sep 17, 2021•1 hr 13 min•Ep. 24
- Intro Song: 00:00 - Intro: 00:34 - News: 5:18 - Concept of the week: Row pattern matching and MATCH_RECOGNIZE: 14:26 - PR of the week: PR 8348 Document row pattern recognition in window: 52:16 - Demo: Showing MATCH_RECOGNIZE functionality by example: 57:13 - Question of the week: How do you tag a list of rows with custom periodic rules?: 1:12:51 Show Notes: https://trino.io/episodes/23.html Show Page: https://trino.io/broadcast/
Aug 09, 2021•1 hr 22 min•Ep. 23
This episode will cover LinkedIn's journey to upgrade from PrestoSQL to Trino and some of the operational challenges LinkedIn's engineering team has faced at their scale. - Intro Song: 00:00 - Intro: 00:36 - News: 7:39 - Concept of the week: Trino usage at LinkedIn: 15:55 - Concept of the week: Trino hardware and operational scale: 23:23 - Concept of the week: Challenges operating at scale: 44:09 - Concept of the week: Open source at LinkedIn: 48:36 - Concept of the week: PrestoSQL to Trino upg...
Aug 03, 2021•1 hr 58 min•Ep. 22
- Intro Song: 00:00 - Intro: 00:35 - News: 7:42 - Question of the week: Can dbt connect to different databases in the same project?: 18:18 - Concept of the week: What is dbt?: 21:28 - Concept of the week: dbt + Trino: 38:09 - Demo: Querying Trino from a dbt project: 47:21 - PR of the week: PR 8283 Externalised destination table cache expiry duration for BigQuery Connector: 1:21:13 Show Notes: https://trino.io/episodes/21.html Show Page: https://trino.io/broadcast/...
Jul 15, 2021•1 hr 26 min•Ep. 21
- Intro Song: 00:00 - Intro: 00:35 - News: 10:16 - Concept of the week: Trino for the Trinewbie: 19:12 - Concept of the week: Marius' Journey: 21:03 - Concept of the week: Contributing to Trino: 54:55 - PR of the week: PR 8135 Set default time zone for the current session: 1:03:36 - Demo: Contributing to Trino: 1:11:49 - Question of the week: How do I search nested objects in Elasticsearch from Trino?: 1:24:24 We didn't have time to run through the demo. I created another video outside of the ...
Jun 29, 2021•1 hr 30 min•Ep. 20
- Intro Song: 00:00 - Intro: 00:37 - News: 7:56 - Concept of the week: Ingesting into Iceberg with Pulsar and Flink at BlueCat: 17:30 - Concept of the week: BlueCat Overview: 20:31 - Concept of the week: Single Tenant to Multi-Tenant: 21:33 - Concept of the week: Pre-Iceberg: 26:13 - Concept of the week: Iceberg: 39:29 - PR of the week: PR 1905 Add format_number function: 1:01:55 - Demo: Showing the format_number functionality: 1:04:38 - Question of the week: How do I search nested objects in ...
Jun 11, 2021•1 hr 18 min•Ep. 19
- Intro Song: 00:00 - Intro: 00:34 - News: 1:44 - Concept of the week: Trino Views, Hive Views, and Materialized Views: 4:57 - PR of the week: PR 4832 Add Iceberg support for materialized views: 59:04 - Demo: Showing the different views in Trino: 1:01:25 - Question of the week: Are JDBC drivers backwards compatible with older Trino versions?: 1:21:02 Show Notes: https://trino.io/episodes/18.html Show Page: https://trino.io/broadcast/
Jun 02, 2021•1 hr 27 min•Ep. 18
- Intro Song: 00:00 - Intro: 00:34 - News: 2:52 - Concept of the week: Resurface and the Resurface connector: 8:58 - PR of the week: PR 4022 Add Soundex function: 1:08:17 - Demo: Using the soundex function: 1:10:27 - Question of the week: Question of the week: How to export query results into a file (e.g. CTAS, but into a single file)?: 1:18:46 Show Notes: https://trino.io/episodes/17.html Show Page: https://trino.io/broadcast/
May 17, 2021•1 hr 25 min•Ep. 17
- Intro Song: 00:00 - Intro: 00:34 - News: 7:28 - Concept of the week: Apache Druid and realtime analytics: 14:51 - PR of the week:PR 3522 Add Druid connector: 33:35 - Demo: Using the Druid Web UI to create an ingestion spec querying via Trino: 1:01:29 - Question of the week: Why doesn’t the Druid connector use the native json over http calls?: 1:10:20 Show Notes: https://trino.io/episodes/16.html Show Page: https://trino.io/broadcast/
May 03, 2021•1 hr 23 min•Ep. 16
- Intro Song: 00:00 - Intro: 00:34 - News: 6:37 - Concept of the week: Apache Iceberg and the Iceberg spec: 13:32 - PR of the week: PR 7233 Fix queries on tables without snapshot id: 1:07:44 - Demo: Creating tables with Iceberg and reading the data in Trino: 1:10:39 - Question of the week: What do I do to restart the test pipeline if it fails on me?: 1:23:35 Show Notes: https://trino.io/episodes/15.html Show Page: https://trino.io/broadcast/...
Apr 21, 2021•1 hr 32 min•Ep. 15
- Intro Song: 00:00 - Intro: 00:34 - News: 7:57 - Concept of the week: Apache Iceberg and the table format: 17:14 - PR of the week: PR 1067 Add Iceberg connector: 59:22 - Demo: Creating tables through Iceberg and reading them through Trino: 1:04:04 - Question of the week: Why do we still depend on the Hive metastore if metadata for Iceberg saves to the filesystem?: 1:18:39 Show Notes: https://trino.io/episodes/14.html Show Page: https://trino.io/broadcast/...
Apr 07, 2021•1 hr 29 min•Ep. 14
- Intro Song: 00:00 - Intro: 00:34 - News: 4:23 - Concept of the week: Data cubes and Apache Pinot: 14:00 - Interview: Apache Pinot: 24:40 - PR of the week: PR 2028 Add Pinot connector: 53:51 - Question of the week: Why does my passthrough query not work in the Pinot connector?: 1:06:23 - Demo: Pinot batch insertion and query using Trino Pinot Connector: 1:12:18 - Get involved with Pinot: 1:22:00 Show Notes: https://trino.io/episodes/13.html Show Page: https://trino.io/broadcast/...
Mar 22, 2021•1 hr 26 min•Ep. 13
- Intro Song: 00:00 - Intro: 00:34 - News : 2:57 - Concept of the week: Trino client, Python, and Apache Superset: 5:37 - Interview: Apache Superset: 19:22 - PR of the week: Superset PR 13105 feat: first step native support Trino: 46:37 - PR Demo: PR Demo: Superset PR 13105 feat: first step native support Trino: 55:13 - Get involved with Superset: 1:25:06 - Question of the Week: How do I use the Trino REST api?: 1:29:02 Show Notes: https://trino.io/episodes/12.html Show Page: https://...
Mar 08, 2021•1 hr 34 min•Ep. 12
- Intro Song: 00:00 - Intro: 00:34 - News : 3:22 - Concept of the week: Dynamic Filtering and Recap: 7:43 - PR of the week: PR 1072 Implement Dynamic Partition Pruning: 17:56 - PR Demo: PR 1072 Implement Dynamic Partition Pruning: 42:03 Show Notes: https://trino.io/episodes/11.html Show Page: https://trino.io/broadcast/
Feb 24, 2021•1 hr 16 min•Ep. 11
Table of Contents: - Intro Song: 00:00 - Intro: 00:32 - Where did the bunny come from? : 8:58 - Bunny names in the running...: 16:58 - And our bunny's name is..: 20:15 - Release 352 sneak peek: 23:46 - Community stats update: 34:54 Show Notes: https://trino.io/episodes/10.html Show Page: https://trino.io/broadcast/
Feb 18, 2021•42 min•Ep. 10
Table of Contents: - Intro Song: 00:00 - Intro: 00:32 - Question of the week: How do I migrate to Trino from PrestoSQL? 11:55 - Concept of the week: Distributed hash-join 16:03 - Quick Discussion: Contributing Documents and Testimonials 58:05 Show Notes: https://trino.io/episodes/9.html Show Page: https://trino.io/broadcast/
Feb 11, 2021•1 hr 3 min•Ep. 9
Table of Contents: - Intro Song: 00:00 - Intro: 00:32 - Martin Traverso Intro: 1:45 - Dain Sundstrom Intro: 4:30 - David Phillips Intro: 8:26 - Eric Hwang Intro: 12:24 - Past: Presto name, Facebook projects, and open source: 16:50 - Past: Where did the name Trino come from?: 23:16 - Past: Why Martin, Dain, and David left Facebook and created a Presto fork: 26:14 - Past: Why did you choose to continue using Presto instead of rebranding?: 34:51 - Past: How did the contending Presto drive us to reb...
Jan 11, 2021•1 hr 10 min•Ep. 8
Table of Contents: - Intro Song: 00:00 - Intro: 00:20 - News: 3:27 - Concept of the week: Cost Based Optimizer 16:48 - PR of the week: PR 1415 Decorrelate subqueries with Limit or TopN 43:09 - PR Demo: EXPLAIN Decorrelate subqueries with Limit or TopN 53:36 - Question of the week: Will running Presto on my relational database make processing faster? 1:02:24 Show Notes: https://trino.io/episodes/7.html Show Page: https://trino.io/broadcast/...
Dec 21, 2020•1 hr 12 min•Ep. 7
Table of Contents: - Intro Song: 00:00 - Intro: 00:20 - News: 12:32 - Concept of the week: Query Planning 20:31 - PR of the week: PR 730 Remove duplicate predicates 29:55 - PR Demo: Remove duplicate predicates demo 33:47 - Question of the week: How should I allocate memory properties? 54:49 Show Notes: https://trino.io/episodes/6.html Show Page: https://trino.io/broadcast/
Dec 07, 2020•1 hr 11 min•Ep. 6
Table of Contents: - Intro Song: 00:00 - Intro: 00:20 - News: 05:26 - Concept of the week: Hive Partitioning 19:12 - PR of the week: PR 223 Add system.sync_partition_metadata procedure to sync Hive table partitions 29:20 - PR Demo: system.sync_partition_metadata procedure demo 34:57 - Question of the week: Why am I getting, 'Query exceeded maximum columns.' error? 1:00:56 Show Notes: https://trino.io/episodes/5.html Show Page: https://trino.io/broadcast/...
Nov 23, 2020•1 hr 8 min•Ep. 5
Table of Contents: - Intro: 0:20 - News: 3:56 - Concept of the week: Presto on ACID 19:35 - PR of the week: PR 5402 Hive ACID row-level INSERT and DELETE 27:24 - PR Demo: Hive ACID row-level INSERT and DELETE demo 42:26 - Question of the week: Why is JDK 11 required to run Presto and how can I revert to JDK8? 54:19 Show Notes: https://trino.io/episodes/4.html Show Page: https://trino.io/broadcast/
Nov 10, 2020•1 hr 1 min•Ep. 4