Arxiv AI Papers
- Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models
- Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
- BrowseMaster: Towards Scalable Web Browsing via Tool-Augmented Programmatic Agent Pair
- OpenCUA: Open Foundations for Computer-Use Agents
- SMA: Who Said That? Auditing Membership Leakage in Semi-Black-box RAG Controlling
- Towards Universal Neural Inference
- SPARC: Soft Probabilistic Adaptive multi-interest Retrieval Model via Codebooks for recommender system
- Dynamic Uncertainty-aware Multimodal Fusion for Outdoor Health Monitoring
- CVCM Track Circuits Pre-emptive Failure Diagnostics for Predictive Maintenance Using Deep Neural Networks
- Can We Trust AI to Govern AI? Benchmarking LLM Performance on Privacy and AI Governance Exams
Arxiv Machine Learning Papers
- Complex Logical Instruction Generation
- Deep Neural Network Calibration by Reducing Classifier Shift with Stochastic Masking
- Constrained free energy minimization for the design of thermal states and stabilizer thermodynamic systems
- Towards Universal Neural Inference
- Bridging Formal Language with Chain-of-Thought Reasoning to Geometry Problem Solving
- Chi-Geometry: A Library for Benchmarking Chirality Prediction of GNNs
- Scaling Up Active Testing to Large Language Models
- Dynamic Uncertainty-aware Multimodal Fusion for Outdoor Health Monitoring
- Meta-learning optimizes predictions of missing links in real-world networks
- VertexRegen: Mesh Generation with Continuous Level of Detail
Two Minute Papers
Microsoft AI
- Dion: the distributed orthonormal update revolution is here
- Reimagining healthcare delivery and public health with AI
- Self-adaptive reasoning for science
- Project Ire autonomously identifies malware at scale
- VeriTrail: Detecting hallucination and tracing provenance in multi-step AI workflows
- Navigating medical education in the era of generative AI
- Xinxing Xu bridges AI research and real-world impact at Microsoft Research Asia – Singapore
- Technical approach for classifying human-AI interactions at scale
- AI Testing and Evaluation: Reflections
- CollabLLM: Teaching LLMs to collaborate with users
Deep Mind
- How AI is helping advance the science of bioacoustics to save endangered species
- Genie 3: A new frontier for world models
- Rethinking how we measure AI intelligence
- Try Deep Think in the Gemini app
- AlphaEarth Foundations helps map our planet in unprecedented detail
- Aeneas transforms how historians connect the past
- Gemini 2.5 Flash-Lite is now ready for scaled production use
- Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad
- Exploring the context of online images with Backstory
- AlphaGenome: AI for better understanding the genome
- Gemini Robotics On-Device brings AI to local robotic devices
Amazon AI
- How Amazon scaled Rufus by building multi-node inference using AWS Trainium chips and vLLM
- Build an intelligent financial analysis agent with LangGraph and Strands Agents
- Amazon Bedrock AgentCore Memory: Building context-aware agents
- Build a conversational natural language interface for Amazon Athena queries using Amazon Nova
- Train and deploy AI models at trillion-parameter scale with Amazon SageMaker HyperPod support for P6e-GB200 UltraServers
- How Indegene’s AI-powered social intelligence for life sciences turns social media conversations into insights
- Unlocking enhanced legal document review with Lexbe and Amazon Bedrock
- Automate AIOps with SageMaker Unified Studio Projects, Part 2: Technical implementation
- Automate AIOps with Amazon SageMaker Unified Studio projects, Part 1: Solution architecture
- Demystifying Amazon Bedrock Pricing for a Chatbot Assistant
- Fine-tune OpenAI GPT-OSS models on Amazon SageMaker AI using Hugging Face libraries