AI Intuition

By Dan Sarmiento

This is the gold rush era of artificial intelligence. You want to learn quickly so you don't get left behind, but how can you learn about AI without an advanced degree in computer science and mathemat... more

· Technology

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about AI Intuition:

How many episodes does AI Intuition have?

The podcast currently has 89 episodes available.

AI Intuition episodes:

August 26, 2025 Pluely - Open-Source Stealth AI Assistant Review
Evaluation of Pluely, an open-source AI assistant designed for privacy and stealth. It highlights Pluely's Tauri-based architecture, which enables a minimal footprint and superior performance compared to its commercial counterpart, Cluely. The document emphasizes Pluely's role as a Human-in-the-Loop (HITL) interface within broader agentic AI systems, leveraging its multi-modal input and unique translucent overlay for discreet assistance. While acknowledging its potential for agentic integration through custom provider hooks, the analysis also points out the project's early maturity and single-developer status as significant risks for enterprise adoption, recommending it primarily for proof-of-concept development. Ultimately, Pluely is presented as a technically impressive and strategically important project, offering an elegant solution to the challenge of seamlessly integrating AI assistance into human workflows.
...more
40min
August 26, 2025 VibeVoice Review - Microsoft's multi-voice text-to-speech
evaluation of Microsoft's VibeVoice, a novel Text-to-Speech (TTS) model designed for long-form, multi-speaker conversational content. They highlight its innovative architecture, which combines an ultra-efficient dual-tokenizer system with a Large Language Model (LLM) backbone, enabling the generation of up to 90 minutes of coherent audio. The analysis emphasizes VibeVoice's unsuitability for real-time interactive agents due to high latency, instead positioning it as a powerful tool for asynchronous content generation tasks like podcasts or audiobooks. Furthermore, the sources discuss the model's emergent capabilities, such as spontaneous background music and singing, and provide a comparative analysis within the open-source TTS landscape, alongside a critical examination of responsible AI considerations and Microsoft's explicit "research and development only" designation. Finally, they cover technical implementation details and potential future directions for the VibeVoice architecture.
...more
1h 5min
August 26, 2025 DeepCode Review - Open-Source Multi-Agent Text-to-Code
evaluation of HKUDS/DeepCode, an ambitious "Open Agentic Coding" platform originating from The University of Hong Kong's Data Intelligence Lab. This system is designed to automate complex code generation by leveraging a multi-agent AI framework, translating high-level concepts into production-ready software. Key features include Paper2Code for converting research papers into code, Text2Web for front-end development, and Text2Backend for server-side logic, all aimed at streamlining the software development lifecycle. The architecture relies on a central orchestrating agent coordinating specialized agents for tasks like intent understanding, planning, resource mining, and code generation, facilitated by a Model Context Protocol (MCP) for tool integration. Notably, the evaluation confirms the high feasibility of integrating DeepCode with a local Ollama server for language model inference, requiring only configuration changes and offering a path to reduce dependency on proprietary services.
...more
19min
August 25, 2025 Deepseek Fine-Tuning Guide
overview of fine-tuning DeepSeek Large Language Models. It explores the architectural evolution of DeepSeek models, from traditional transformers to efficient Mixture-of-Experts (MoE) designs, and categorizes the various DeepSeek models for different applications. The guide details essential fine-tuning techniques, particularly focusing on Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA and QLoRA, which significantly reduce computational demands. It also emphasizes the critical role of high-quality dataset preparation, outlines the necessary software tools and frameworks, and offers practical advice on hardware infrastructure and hyperparameter tuning for optimal performance, culminating in strategies for model evaluation and seamless deployment.
...more
29min
August 25, 2025 LLM Distillation: Theory, Application, and Roadmap
Large Language Model (LLM) distillation, a technique for transferring knowledge from a large, powerful "teacher" model to a smaller, more efficient "student" model. It explains the core principles, including the use of soft targets generated by the teacher's probability distributions, as opposed to traditional hard labels, and the role of temperature scaling in softening these distributions to reveal more nuanced knowledge. The article details various distillation techniques, such as offline, online, and self-distillation, along with the differences between response-based and feature-based methods, before breaking down the technical mechanics of the distillation loss function and its components. Furthermore, it presents a case study using the DeepSeek model family, demonstrating how advanced reasoning capabilities are transferred through synthetic data generation and multi-stage training. Finally, the text addresses hardware infrastructure considerations for distillation, outlining VRAM requirements, GPU recommendations, and a practical roadmap for implementing a custom distillation project.
...more
58min
August 24, 2025 All About Deepseek
DeepSeek, an innovative Chinese AI company that provides open-source, highly optimized, and cost-efficient AI models designed to challenge established US tech companies. They detail how DeepSeek functions as a comprehensive toolkit for developers, offering features like drop-in API compatibility with OpenAI and flexible licensing for commercial use. The texts also illustrate practical applications, such as building local RAG chatbots for enhanced privacy and creating AI agents for complex, multi-step tasks, emphasizing the importance of embeddings for semantic understanding. Additionally, the sources highlight developer tools like Cursor for AI-first coding environments and outline a structured lifecycle for Gen AI project development.
...more
1h 35min
August 23, 2025 Google AI Agent Design - Architecture with the Agent Development Kit
overview of key features within the Agent Development Kit (ADK), focusing on building intelligent agents. They explain how to implement structured outputs using Pydantic models for consistent data formatting and session management to maintain context across user interactions. The texts also cover persistent storage through database integration, enabling agents to retain information long-term, and detail the architecture for multi-agent systems where specialized agents collaborate. Furthermore, various workflow agents are described, including sequential agents for ordered task execution, parallel agents for concurrent processing, and loop agents for iterative refinement. Finally, a Gemini model overview provides guidance on selecting appropriate models based on capabilities, performance, and cost within the ADK environment.
...more
40min
August 23, 2025 AGENTS.md - the standard AI instructions file for AI Agents
The AGENTS.md standard, an industry-wide initiative to unify instructions for AI coding agents within software development. Before this standard, the ecosystem suffered from fragmentation, with each AI tool requiring its own proprietary configuration file, leading to significant maintenance overhead and hindered interoperability. AGENTS.md establishes a simple, open Markdown-based format that acts as a dedicated "onboarding document" for AI agents, separating their instructions from human-facing documentation like README.md. This collaborative effort, supported by major players like OpenAI and Google, aims to streamline AI-assisted workflows, foster competition, and ensure that AI agents can effectively understand and contribute to projects, even within complex monorepos. The standard is seen as a foundational element that will shape the future of AI tooling and human-agent collaboration in software engineering.
...more
57min
August 22, 2025 Vertex Agent Garden - Image Scoring Agent Review
technical analysis of a Google GitHub repository showcasing an image scoring agent. The project's core purpose is to automate subjective image evaluation using multimodal Large Language Models like Gemini Pro Vision, processing both textual criteria and images. It outlines the application's architecture, including its command-line interface and separation of concerns between main application flow and AI logic. The analysis details the data and logic flow, highlighting multimodal prompt construction and structured JSON output from the LLM. Furthermore, it covers the technology stack (Python, google-generativeai, Pillow) and provides a guide for replication, emphasizing key concepts like multimodal prompting and structured output for reliable applications, while also addressing potential implementation challenges such as prompt reliability and API limits.
...more
34min
August 22, 2025 Vertex Agent Garden - Gemini Full Stack Agent Review
technical analysis of a GitHub repository designed as a full-stack starter kit for building web applications featuring a Gemini-powered chat interface. It thoroughly explains the project's purpose, which is to provide a robust foundation for developers to create custom AI web apps without starting from scratch. The overview breaks down the technical architecture, detailing the client-server model with a Python Flask backend for AI logic and a JavaScript frontend for the user interface, along with the data and logic flow. It also highlights key algorithms like streaming HTTP responses and crucial technologies, guiding developers through replication and potential challenges like CORS errors and API key management.
...more
51min

FAQs about AI Intuition:

How many episodes does AI Intuition have?

The podcast currently has 89 episodes available.