Two Minute Papers
Arxiv Machine Learning Papers
- DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness
- QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
- Evaluation of Machine-generated Biomedical Images via A Tally-based Similarity Measure
- Differential equation quantum solvers: engineering measurements to reduce cost
- Tropical Bisectors and Carlini-Wagner Attacks
- Sentiment Classification of Thai Central Bank Press Releases Using Supervised Learning
- Challenges and Paths Towards AI for Software Engineering
- Using Machine Learning for Lunar Mineralogy-I: Hyperspectral Imaging of Volcanic Samples
- Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users
- Generative Latent Neural PDE Solver using Flow Matching
Arxiv AI Papers
- DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness
- Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
- QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
- ActionStudio: A Lightweight Framework for Data and Training of Action Models
- Exploring the Effectiveness of Multi-stage Fine-tuning for Cross-encoder Re-rankers
- Evaluation of Machine-generated Biomedical Images via A Tally-based Similarity Measure
- Unicorn: Text-Only Data Synthesis for Vision Language Model Training
- Empirical Analysis of Sim-and-Real Cotraining Of Diffusion Policies For Planar Pushing from Pixels
- Challenges and Paths Towards AI for Software Engineering
- Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users
Microsoft AI
- Research Focus: Week of March 24, 2025
- The reality of generative AI in the clinic
- Metasurface: Unlocking the future of wireless sensing and communication
- Claimify: Extracting high-quality claims from language model outputs
- Introducing KBLaM: Bringing plug-and-play external knowledge to LLMs
- Semantic Telemetry: Understanding how users interact with AI systems
- The AI Revolution in Medicine, Revisited: An Introduction
- Advancing biomedical discovery: Overcoming data challenges in precision medicine
- Magma: A foundation model for multimodal AI agents across digital and physical worlds
- Exploring the structural changes driving protein function with BioEmu-1
Deep Mind
- Gemini 2.5: Our most intelligent AI model
- Gemini Robotics brings AI into the physical world
- Experiment with Gemini 2.0 Flash native image generation
- Introducing Gemma 3
- Start building with Gemini 2.0 Flash and Flash-Lite
- Gemini 2.0 is now available to everyone
- Updating the Frontier Safety Framework
- FACTS Grounding: A new benchmark for evaluating the factuality of large language models
- State-of-the-art video and image generation with Veo 2 and Imagen 3
- Introducing Gemini 2.0: our new AI model for the agentic era
- Google DeepMind at NeurIPS 2024
Amazon AI
- Amazon Bedrock Guardrails image content filters provide industry-leading safeguards, helping customer block up to 88% of harmful multimodal content: Generally available today
- Integrating custom dependencies in Amazon SageMaker Canvas workflows
- Generate training data and cost-effectively train categorical models with Amazon Bedrock
- Enable Amazon Bedrock cross-Region inference in multi-account environments
- Amazon SageMaker JumpStart adds fine-tuning support for models in a private model hub
- Generative AI-powered game design: Accelerating early development with Stability AI models on Amazon Bedrock
- Amazon Bedrock launches Session Management APIs for generative AI applications (Preview)
- Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference
- Evaluate and improve performance of Amazon Bedrock Knowledge Bases
- Enhance enterprise productivity for your LLM solution by becoming an Amazon Q Business data accessor
- Build a generative AI enabled virtual IT troubleshooting assistant using Amazon Q Business
Lex Fridman
- Douglas Murray: Putin, Zelenskyy, Trump, Israel, Netanyahu, Hamas & Gaza | Lex Fridman Podcast #463
- Ezra Klein and Derek Thompson: Politics, Trump, AOC, Elon & DOGE | Lex Fridman Podcast #462
- ThePrimeagen: Programming, AI, ADHD, Productivity, Addiction, and God | Lex Fridman Podcast #461
- Narendra Modi: Prime Minister of India - Power, Democracy, War & Peace | Lex Fridman Podcast #460
- DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters | Lex Fridman Podcast #459
What's AI
- How Open-Sora 2.0 Built Sora-Level Video AI for $200K (Full Breakdown)
- AI Explained For Complete Beginners - No Math! Towards AI's Python Primer for Generative AI
- DeepSeek's FlashMLA Explained
- What is Reinforcement Fine-Tuning (RFT) - Supervised vs. RL LLM Re-training
- How LLMs Can be Used as Coding Assistants (Python for AI Beginners Course by Towards AI)
Yannic Kilcher
- [GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
- Traditional Holiday Live Stream
- Byte Latent Transformer: Patches Scale Better Than Tokens (Paper Explained)
- Safety Alignment Should be Made More Than Just a Few Tokens Deep (Paper Explained)
- TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters (Paper Explained)