Anaconda is a popular platform for data science, machine learning, and AI. It provides trusted repositories of Python and R packages and has over 35 million users worldwide. Rob Futrick is the CTO at Anaconda , and he joins the show to talk about the platform, the concept of an OS for AI, and more. This episode is hosted by Lee Atchison. Lee Atchison is a software architect, author, and thought leader on cloud computing and application modernization. His best-selling book, Architecting for Scale...
May 02, 2024•44 min
Vercel provides a cloud platform to rapidly deploy web projects, and they develop the highly successful Next.js framework. The company recently made headlines when they announced v0 which is a generative AI tool to create React code from text prompts. The generated code uses open-source tools like Tailwind CSS and shadcn/ui. Lee Robinson is the VP of Product at Vercel . He helps lead the product teams and focuses on developer experience on the platform. He joins the show to talk about Vercel, th...
Jan 30, 2024•49 min
Algolia is a platform that provides search as a service. The company was founded in 2012, was part of Y Combinator’s Winter 2014 class, and has become highly popular for integrating modern search functionality into web-facing services. Sean Mullaney is the CTO of Algolia and has worked at Google X, Stripe, and Zolando. He joins the show today to talk about Algolia, neural search, vector compression, search optimization, and more. Sean’s been an academic, startup founder, and Googler. He has publ...
Jan 23, 2024•49 min
Jodie Burchell is the Data Science Developer Advocate at JetBrains, which makes integrated development environments or, IDEs, for many major languages. After observing the rapid growth of the AI coding assistant landscape, the company recently announced integration of an AI assistant into their IDEs. Jodie joins the show today to talk about why the company decided to take this step, the design challenges of adding AI tools to software products, and the team’s particular interest in auto-generati...
Jan 16, 2024•55 min
This episode of Software Engineering Daily is part of our on-site coverage of AWS re:Invent 2023, which took place from November 27th through December 1st in Las Vegas. In today’s interview, host Jordi Mon Companys speaks with Ankur Mehrotra who is the Director and GM of Amazon SageMaker. Jordi Mon Companys is a product manager and marketer that specializes in software delivery, developer experience, cloud native and open source. He has developed his career at companies like GitLab, Weaveworks, ...
Jan 09, 2024•35 min
An embedding is a concept in machine learning that refers to a particular representation of text, images, audio, or other information. Embeddings are designed to make data consumable by ML models. However, storing embeddings presents a challenge to traditional databases. Vector databases are designed to solve this problem. Pinecone has developed one of the most prominent vector databases that is widely used for ML and AI applications. Marek Galovic is a software engineer at Pinecone and works on...
Jan 02, 2024•39 min
Vespa is a fully featured search engine and vector database, and it has integrated ML model inference. The project open sourced in 2017, and since then has grown to become a prominent platform for applying AI to big data sets at serving time. Vespa began as a project to solve Yahoo’s use cases in search, recommendation, and ad serving. The company made headlines in October when they announced they’re spinning Vespa.ai out of Yahoo as a separate company. Jon Bratseth is the CEO at Vespa and he jo...
Dec 26, 2023•46 min
GitHub Copilot is an AI tool developed by GitHub and OpenAI to assist software developers by autocompleting code. Copilot kicked off a revolution in software engineering, and AI assistants are now considered essential tools to many developers. Joseph Katsioloudes is a cyber security specialist and works at the GitHub Security Lab . He joins the show today to talk about Copilot, the future of software development in an AI world, using AI to improve security, and more. Check out Joseph’s bio and t...
Dec 19, 2023•50 min
Machine learning model research requires running expensive, long-running experiments where even a slight mis-calibration can cost millions of dollars in underutilized compute resources. Once trained, model deployment, production monitoring, and observability requirements all present unique operational challenges. Chris Van Pelt is the Chief Information Officer of Weights and Biases , which is the industry standard in experiment monitoring and visualization, and has expanded that expertise into a...
Dec 12, 2023•38 min
Hugging Face was founded in 2016 and has grown to become one of the most prominent ML platforms. It’s commonly used to develop and disseminate state-of-the-art ML models and is a central hub for researchers and developers. Sayak Paul is a Machine Learning Engineer at Hugging Face and a Google Developer Expert. He joins the show today to talk about how he entered the ML field, diffusion model training, the transformer-based architecture, and more. Sean’s been an academic, startup founder, and Goo...
Dec 05, 2023•47 min
Cloud-based software development platforms such as GitHub Codespaces continue to grow in popularity. These platforms are attractive to enterprise organizations because they can be managed centrally with security controls. However, many, if not most, developers prefer a local IDE. Daytona is aiming to bridge that gap. It’s a layer between a local IDE and a backend server, so developers can work locally while interfacing invisibly with a remote environment. Ivan Burazin is the CEO and Co-Founder a...
Nov 23, 2023•48 min
Knowledge graphs are an intuitive way to define relationships between objects, events, situations, and concepts. Their ability to encode this information makes them an attractive database paradigm. Hume is a graph-based analysis solution developed by GraphAware . It represents data as a network of interconnected entities and provides analysis capabilities to extract insights from the data. Luanne Misquitta is VP of Engineering at GraphAware and she joins the show today to talk about graph databa...
Nov 22, 2023•58 min
Speech technology has been around for a long time, but in the last 12 months it’s undergone a quantum leap. New speech synthesis models are able to produce speech that’s often indistinguishable from real speech. I’m sure many listeners have heard deep fakes where computer speech perfectly mimics the voice of famous actors or public figures. A major factor in driving the ongoing advances is generative AI. Speechlab is at the forefront of using new AI techniques for realtime dubbing, which is the ...
Nov 02, 2023•45 min
If you’re a developer, there’s a good chance you’ve experimented with coding assistants like GitHub Copilot. Many developers have even fully integrated these tools into their workflows. One way these tools accelerate development is by autocompleting entire blocks of code. The AI achieves this by having awareness of the surrounding code. It understands context. However, in many cases the context available to an AI is limited. This restricts the AI’s ability to suggest more sweeping changes to a c...
Nov 01, 2023•42 min
When StackOverflow launched in 2008 it lowered the barrier to writing complex software. It solved the longstanding problem of accessing accurate and reliable programming knowledge by offering a collaborative space where programmers could ask questions, share insights, and receive high-quality answers from a community of experts. Generative AI has impacted the way programmers want to consume this knowledge. It has also opened new possibilities in terms of getting a personalized and real-time resp...
Oct 03, 2023•48 min
AI-assisted software delivery refers to the utilization of artificial intelligence to assist, enhance, or automate various phases of the software development lifecycle. AI can be used in numerous aspects of software development, from requirements gathering to code generation to testing and monitoring. The overarching aim is to streamline software delivery, reduce errors and, ideally, reduce the time and costs associated with software development. Birgitta Böckeler is the Global Lead for AI-assis...
Aug 15, 2023•1 hr 2 min
Generative pre-trained transformer models, or GPT models, have countless applications and are being rapidly deployed across a wide range of domains. However, using GPT models without appropriate safeguards can lead to leakage of sensitive data. This concern underscores the critical need for privacy and data protection. Skyflow LLM Privacy Vault prevents sensitive data from reaching GPTs. Amruta Moktali is the Chief Product Officer at Skyflow and she joins us today. We discuss generative AI, how ...
Jul 27, 2023•52 min
There are many types of early stage funding available from friends and family to seed to series A. Some firms invest across a wide set of technologies and seek only to provide capital. Others are in it for the long haul – they focus on specific areas of technology and develop both long term relationships and deep expertise over time. Today, we are interviewing Matt Turck of First Mark Capital , who is in it for the long haul and whose portfolio companies include Dataiku, Crossbeam, Ada, Cockroac...
Mar 10, 2023•51 min
ChatGPT is an artificial intelligence language model developed by OpenAI. It is part of the GPT (Generative Pre-trained Transformer) family of models, which are designed to generate human-like text based on input prompts. ChatGPT is specifically trained to carry out conversational tasks, such as answering questions, completing sentences, and engaging in dialogue. It has been pre-trained on a large corpus of text data and fine-tuned on specific tasks to improve its performance. As a result, ChatG...
Feb 24, 2023•1 hr 13 min
Today, we spoke with Daniel Situnayake of Edge Impulse . We discussed AI, machine learning, edge devices, TinyML and AI tool chain. Sponsorship inquiries: sponsor@softwareengineeringdaily.com The post Edge Impulse with Daniel Situnayake appeared first on Software Engineering Daily ....
Oct 01, 2022•49 min
The default configuration in most databases is meant for broad compatibility rather than performance. Database tuning is a process in which the configurations of a database are modified to achieve optimal performance. Databases have hundreds of configuration knobs that control various factors, such as the amount of memory to use for caches or how often the data is written to the storage. The problem with these knobs is that they are not standardized (i.e., two databases may have a different name...
Sep 23, 2022•40 min
Originally published on January 1, 2022. Charlie Gerard is an incredibly productive developer. In addition to being the author of Practical Machine Learning in JavaScript, her website charliegerard.dev has a long list of really interesting side projects exploring the intersection of human computer interaction, computer vision, interactivity, and art. In this episode we touch on some of these projects and broadly explore how practical it is to bring interesting HCI concepts into one’s work. Spons...
Sep 18, 2022•31 min
At Lyft, Ketan Umare worked on Flyte, an orchestration system for machine learning. Flyte provides reliability and APIs for machine learning workflows, and is used at companies outside of Lyft such as Spotify. Since leaving Lyft, Ketan founded Union.ai, a company focused on productionizing Flyte as a service. He joins the show to talk about the architecture and usage of Flyte, as well as how he is formulating the company around it. Sponsorship inquiries: sponsor@softwareengineeringdaily.com The ...
Mar 23, 2022•55 min
Historically, search engines made money by showing sponsored ads alongside organic results. As the idiom goes, if you’re not paying for something, you are the product. Neeva is a new take on search engines. When you search at neeva.com, you get the type of result you’d expect from a search engine minus any advertising. In this episode, I speak with Darin Fisher, Software Engineer at Neeva. We discuss the motivation, implementation, and mobile experience for searching with Neeva. Sponsorship inqu...
Jan 25, 2022•49 min
Charlie Gerard is an incredibly productive developer. In addition to being the author of Practical Machine Learning in JavaScript, her website charliegerard.dev has a long list of really interesting side projects exploring the intersection of human computer interaction, computer vision, interactivity, and art. In this episode we touch on some of these projects and broadly explore how practical it is to bring interesting HCI concepts into one’s work. Sponsorship inquiries: sponsor@softwareenginee...
Jan 04, 2022•37 min
Once a machine learning model is trained and validated, it often feels like a major milestone has been achieved. In reality, it’s more like the first lap in a relay race. Deploying ML to production bears many similarities to a typical software release process, but brings several novel challenges like failing to generalize as expected or model drift. AI Quality management is the biggest challenge in AI today. In this episode, I interview Anupam Datta, the co-founder at TruEra. TruEra has a soluti...
Nov 30, 2021•54 min
Machine learning models must first be trained. That training results in a model which must be serialized or packaged up in some way as a deployment artifact. A popular deployment path is using Tensorflow.js to take advantage of the portability of JavaScript, allowing your model to be run on a web server or client. Gant Laborde is Chief Innovation Officer at Infinite Red, a React Native consulting team and the author of Learning TensorFlow.js: Powerful Machine Learning in JavaScript from O’Reilly...
Nov 09, 2021•51 min
Imagine a world where you own some sort of building whether that’s a grocery store, a restaurant, a factory… and you want to know how many people reside in each section of the store, or maybe how long did the average person wait to be seated or how long did it take the average factory worker to complete their assembly task. Currently today these systems are either not using AI and instead use a mix of sensors and buttons to track certain actions or they do use AI but in a way that’s highly speci...
Oct 22, 2021•43 min
The dream of machines with artificial general intelligence is entirely plausible in the future, yet well beyond the reach of today’s cutting edge technology. However, a virtual agent need not win in Alan Turing’s Imitation Game to be useful. Modern technology can deliver on some of the promises of narrow intelligence for accomplishing specific tasks. PeopleReign has created a virtual agent for IT and HR employee service. This agent’s goal is not to replace a human agent but to augment them by ha...
Sep 22, 2021•46 min
Interest in autonomous vehicles dates back to the 1920s. It wasn’t until the 1980s that the first truly autonomous vehicle prototypes began to appear. The first DARPA Grand Challenge took place in 2004 offering competitors $1 million dollars to complete a 150-mile course through the Mojave desert. The prize was not claimed. Since then, rapid progress has begun in autonomous driving fueled by advances in sensor technology, software, and the hardware which runs it. Infrastructure has become a seri...
Sep 20, 2021•37 min