Hello everyone, in this episode I talk about GraphRAG. This new RAG technique is very useful to better retrieve global information from your document or set of documents. It's also a subject that is growing a lot! GraphRAG paper: https://arxiv.org/pdf/2404.16130 LightRAG paper: https://arxiv.org/pdf/2410.05779 Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai...
Dec 05, 2024•14 min
Hello guys, in this episode of the podcast I talk about on-device AI and the SmolLM blog post published by Hugging Face. I first give my thoughts on the differences between on-device and on-cluster AI with the different needs of applications we may have. Then I go through the SmolLM blog post explaining some of the details. SmolLM blog post: https://huggingface.co/blog/smollm MobileLLM paper: https://arxiv.org/pdf/2402.14905 Instagram of the podcast: https://www.instagram.com/podcast.lifewithai ...
Nov 07, 2024•16 min
Hey guys, in the Brazilian version of the podcast I interviewed Felipe, CEO and Founder of Clarice AI, the Brazilian Grammarly. In the episode we discussed on how they created both their corrector and their style improvement algorithms. Clarice AI website: https://clarice.ai/ Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Oct 31, 2024•10 min
Hey guys, in the brazilian version of the podcast I interviewed Hugo Abonizio, he is LLM engineer at Maritaca AI, the only brazilian LLM company that just released Sabia 3. Their model is specialized in Portuguese and is between gpt4o and Claude 3.5 Sonnet, while being way smaller and cheaper. In the episode I try to give an idea of our conversation in the portuguese version of the podcast. Sabia 3 paper: https://arxiv.org/pdf/2410.12049 Instagram of the podcast: https://www.instagram.com/podcas...
Oct 24, 2024•10 min
Hey guys, in this episode I try to explain my ideas on why Chain of Thought works. Of course nothing that I say is proved, the idea is really to give my intuitions on it! Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Sep 26, 2024•8 min
Hey guys, in this episode I talk about the new model of OpenAI, the OpenAI o1, the model that thinks and reflects before answering. In the episode I share my opinion and my thoughts after the first contact with the model. Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Sep 19, 2024•8 min
Hey guys, in this episode I talk about the different types of AI. I don't go deep into the models, I focus more on explaining the different types and which kind of problems they solve. As types of AI I talk about statistics, machine learning, reinforcement learning, deep learning and generative AI. Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Sep 12, 2024•17 min
Hey guys, this is the last episode of the Llama3 paper. In this episode, I talk about their discussion on the contamination analysis of the benchmarks, the vision and the speech parts of the model. I hope you enjoyed the series of episodes about the paper, I personally learned a lot! Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai Llama 3 paper: https://scontent-cdg4-3.xx.fbcdn.net/v/t39.2365-6/45238777...
Aug 22, 2024•27 min
Hey guys, in this episode I talk about the Llama 3 paper pre-training and post-training! Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Aug 15, 2024•29 min
Hey guys, FINALLY we have Llama 3 paper, the release of the 405B model and the update of the 8B and 70B models. In this episode I give my thoughts about the paper and also an overview about it. In the next episode I will go more deeper in more details of the paper. Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai Llama 3 paper: https://scontent-cdg4-3.xx.fbcdn.net/v/t39.2365-6/452387774_1036916434819166_...
Jul 25, 2024•18 min
Hey guys, in this episode I talk about how to choose the best model for your AI application. I discuss the different tradeoffs you should take into considering before choosing which model to test and to deploy. Transformers tutorials github by Niels Rogge: https://github.com/NielsRogge/Transformers-Tutorials Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Jul 18, 2024•11 min
Hey guys, in this episode of the podcast I talk about Stable Diffusion, a famous open source image generation algorithm. Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai Stable diffusion paper: https://arxiv.org/pdf/2112.10752 Good Medium post about SD: https://medium.com/@steinsfu/stable-diffusion-clearly-explained-ed008044e07e#97f4 Amazing YouTube videos explaining diffusion: https://www.youtube.com/...
Jul 11, 2024•8 min
Hey guys, in this episode I talk about diffusion models, the algorithm behind all the image generation models today, like stable diffusion, Midjourney and dall-e. In the episode I explain the diffusion process along with some other technical concepts that are important, like white noise and markovian process. Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai Blog post explaining diffusion: https://www.ass...
Jul 04, 2024•14 min
Hey guys, in this episode I talk about 3 very important models that use contrastive learning, CLIP, SigLIP and JinaCLIP. They are image-text embedding models that allow us to for instance do image-text retrieval. CLIP paper: https://arxiv.org/pdf/2103.00020 SigLIP paper: https://arxiv.org/pdf/2303.15343 JinaCLIP paper: https://arxiv.org/pdf/2405.20204 Github of similarities and contrastive loss: https://github.com/filipelauar/projects/blob/main/similarities_and_contrastive_loss.ipynb Instagram o...
Jun 27, 2024•13 min
Hey guys, in this episode I talk about two very important technical concepts in Deep Learning, constrastive learning and cosine similarity. They are very useful when training embedding models or doing RAG. Very good blog post about contrastive losses: https://lilianweng.github.io/posts/2021-05-31-contrastive/ SimCLR paper: https://arxiv.org/abs/2002.05709 Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai...
Jun 20, 2024•12 min
Hey guys, in this episode I talk about the FineWeb dataset, the best pre-training open source dataset to date. In the episode I explain how they created the dataset and I also share some results. Link to the huggingface blog: https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1 Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai...
Jun 13, 2024•12 min
Fala galera, nesse episódio eu falo com o João Batista, Technical Product Manager da Stackspot AI. No episódio a gente falou bastante sobre o uso de LLMs como um copilot usando os próprios documentos da empresa para dar as respostas usando RAG. Hey guys, in the brazilian version of the podcast I discussed with Joao from Stackspot AI. In the episode I talk about how they are developing enterprise copilot assistants using RAG. In the episode we talk both about technical and product aspects, like s...
Jun 06, 2024•10 min
Hey guys, in this episode I talk about two papers, BitNet and 1.58 bit Transformer. These two papers from microsoft tell a new receipe to train 1 bit transformers, improve hugely the memory and energy consumption along with lower inference times. BitNet paper: https://arxiv.org/pdf/2310.11453 1.58 bit paper: https://arxiv.org/pdf/2402.17764 Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai...
May 30, 2024•10 min
Extra episode about Llama 3.
Apr 19, 2024•12 min
Hey guys, continuing the series of episodes about PEFT, in this episode I talk about inference optimization techniques for LLMs. I talk about layer pruning, where we prune consecutive layers of the LLM without almost not losing model performance. I also talk about Mixture of Depths, a similar technique to Mixture of Experts, where we have a router that choses which tokens will be processed in which layer of the LLM. Paper MoD: https://arxiv.org/pdf/2404.02258.pdf Paper layer pruning: https://...
Apr 18, 2024•14 min
Hey guys, this is the first episode in a series of episodes about PEFT, Parameter Efficient Fine Tuning. In this episode I talk about LoRA and QLoRA, two widely used methods that allowed us to fine tune LLMs way faster and in a single GPU without losing performance. Video sobre QLoRA: https://www.youtube.com/watch?v=6l8GZDPbFn8 LoRA paper: https://arxiv.org/pdf/2106.09685.pdf QLoRA paper: https://arxiv.org/pdf/2305.14314.pdf Instagram do podcast: https://www.instagram.com/podcast.lifewitha...
Apr 11, 2024•14 min
Hello, in this episode I talk a Retrieval Aware Fine Tuning (RAFT), a paper that proposes a new technique to use both domain specific fine-tuning and RAG to improve the retrieval capabilities of LLMs. In the episode I also talk about another paper that is called RAFT, but this time Reward rAnking Fine Tuning, which proposes a new technique to perform RLHF without the convergence problems of Reinforcement Learning. Retrieval Aware Fine Tuning: https://arxiv.org/abs/2403.10131v1 Reward rAnking Fin...
Mar 21, 2024•9 min
Hello guys, in this episode I explain how we can scale the context window of an LLM to more than 1M tokens using Ring Attention. In the episode, I also discuss if RAG is dead or not based on these advancements in the context window. Paper Lost in the Middle: https://arxiv.org/pdf/2307.03172.pdf Gemini technical report: https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf Paper Ring Attention: https://arxiv.org/pdf/2310.01889.pdf Instagram of the podcast: https://www.instag...
Mar 07, 2024•13 min
Hey guys, in the Brazilian version of the Podcast I interviewed Andre, he is an AI expert on IBM and we talked a lot about how to solve problems using AI. Brains website: https://brains.dev/ Andre's Linkedin: https://www.linkedin.com/in/andrefelipelopes/ Brains' Linkedin: https://www.linkedin.com/company/brains-brazilian-ai-networks/ Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai...
Feb 22, 2024•16 min
Hey guys, in this episode I talk about Mixture Of Experts, more specifically about Mixtral, which is today the best open-source LLM available, and also better than ChatGPT 3.5 and Gemini Pro. Mixtral paper: https://arxiv.org/pdf/2401.04088.pdf Mixtral model: https://huggingface.co/mistralai/Mixtral-8x7B-v0.1 Mixtral YouTube: https://www.youtube.com/watch?v=mwO6v4BlgZQ Instagram: https://www.instagram.com/podcast.lifewithai Linkedin: https://www.linkedin.com/company/life-with-ai...
Jan 18, 2024•11 min
Hey guys, in this episode I have Sergei as guest and we talked a lot about NLP and Named Entity Recognition (NER). Sergei and his colleagues at NuMind have the current state or the art model for NER and we discussed a lot about it during the episode. English model: https://huggingface.co/numind/generic-entity_recognition_NER-v1 Multi-language model: https://huggingface.co/numind/generic-entity_recognition_NER-multilingual-v1 Sergei's Linkedin: https://www.linkedin.com/in/svbogdanov/ Instag...
Dec 07, 2023•31 min
Hey guys, in this episode I explain most of what I know about Transformers. I talk about the architecture, the attention formula, encoder, decoder, self-supervised learning, positional encoding, tokenization, inductive bias, Vision-Transformers, receptive fields... It was the most technical episode I've recorded so far, and I hope you like it! By the way, it worth listening to this episode with the Transformers paper. Paper Transformers: https://arxiv.org/pdf/1706.03762.pdf Link of OpenAI expl...
Nov 30, 2023•24 min
Hey guys, in this episode I explain RAG (Retrieval Augmented Generation) and the concept of agents executing different tasks. Hope you like it! Instagram: https://www.instagram.com/podcast.lifewithai/ Linkedin: https://www.linkedin.com/company/life-with-ai
Nov 16, 2023•13 min
Hey guys, in the brazilian version of the podcast I interviewed Daniel, CTO of WeClever, a company that uses AI to improve chat bot experience. In the episode we talked about fine tuning ChatGPT, LoRA, RAG and more! WeClever: https://www.linkedin.com/company/wecleverco/ Daniel Linkedin: https://www.linkedin.com/in/dmerlimorais/ Instagram: https://www.instagram.com/podcast.lifewithai/ Linkedin: https://www.linkedin.com/company/life-with-ai...
Nov 02, 2023•9 min
Hey guys, this episode was really great! I have tips for both technical and behavioral interviews. In the technical interview part, I talk about what is always present in interviews and is always good to know! I also talk about the biggest technical mistakes that people make in interviews and explain them. In the behavioral part, I talk a little about the interviewer's vision and what is important to know! Instagram: https://www.instagram.com/podcast.lifewithai/ Linkedin: https://www.linkedin.co...
Aug 31, 2023•15 min