Best AI papers explained
Podcast tekijän mukaan Enoch H. Kang

Kategoriat:
183 Jaksot
-
Test-Time RL: Self-Evolving LLMs via Majority Voting Rewards
Julkaistiin: 25.4.2025 -
Tina: Tiny LoRA Reasoning Models
Julkaistiin: 25.4.2025 -
Evaluating large language models in theory of mind tasks
Julkaistiin: 25.4.2025 -
QUEST: Quality Sampling for Machine Translation
Julkaistiin: 24.4.2025 -
Offline Preference Learning via Simulated Trajectory Feedback
Julkaistiin: 24.4.2025 -
Reasoning Elicitation in Language Models via Counterfactual Feedback
Julkaistiin: 24.4.2025 -
Eliciting Human Preferences with Language Models
Julkaistiin: 24.4.2025 -
Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Julkaistiin: 24.4.2025 -
γ-Bench: Evaluating LLMs in Multi-Agent Games
Julkaistiin: 24.4.2025 -
DRAFT: Self-Driven LLM Tool Mastery via Documentation Refinement
Julkaistiin: 24.4.2025 -
Optimal Prediction Sets for Enhanced Human-AI Accuracy
Julkaistiin: 24.4.2025 -
Self-Correction via Reinforcement Learning for Language Models
Julkaistiin: 24.4.2025 -
Tractable Multi-Agent Reinforcement Learning through Behavioral Economics
Julkaistiin: 24.4.2025 -
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
Julkaistiin: 24.4.2025 -
Iterative Nash Policy Optimization for Language Model Alignment
Julkaistiin: 24.4.2025 -
SycEval: Benchmarking LLM Sycophancy in Mathematics and Medicine
Julkaistiin: 23.4.2025 -
Stack AI: Democratizing Enterprise AI Development
Julkaistiin: 22.4.2025 -
Evaluating Modern Recommender Systems: Challenges and Future Directions
Julkaistiin: 22.4.2025 -
AI in the Enterprise: Seven Lessons from Frontier Companies by OpenAI
Julkaistiin: 22.4.2025 -
Discussion: Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Julkaistiin: 21.4.2025
Men know other men best. Women know other women best. And yes, perhaps AIs know other AIs best. AI explains what you should know about this week's AI research progress.