Best AI papers explained

Podcast tekijän mukaan Enoch H. Kang

kokeile Podimo ilmaiseksi 90!!! päivän ajan

universumia joka on täynnä satoja podcasteja ja äänikirjoja, klikkaa tätä kokeillaksesi

527 Jaksot

Training Agents Inside of Scalable World Models
Julkaistiin: 8.10.2025
Small Language Models are the Future of Agentic AI
Julkaistiin: 7.10.2025
Activation Steering in Generative Settings via Contrastive Causal Mediation Analysis
Julkaistiin: 6.10.2025
Eliciting Secret Knowledge from Language Models
Julkaistiin: 6.10.2025
Temporal difference flow
Julkaistiin: 6.10.2025
Personalized reasoning: just-in-time personalization and why LLMs fail at it
Julkaistiin: 5.10.2025
Prompt Curriculum Learning for Efficient LLM Post-Training
Julkaistiin: 5.10.2025
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
Julkaistiin: 4.10.2025
Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
Julkaistiin: 4.10.2025
Learning to summarize user information for personalized reinforcement learning from human feedback
Julkaistiin: 4.10.2025
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF
Julkaistiin: 3.10.2025
LIMI: Less is More for Agency
Julkaistiin: 1.10.2025
LoRA Without Regret
Julkaistiin: 1.10.2025
Actor-Critic without Actor: Critic-Guided Denoising for RL
Julkaistiin: 29.9.2025
DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?
Julkaistiin: 29.9.2025
Linear Transformers Implicitly Discover Unified Numerical Algorithms
Julkaistiin: 29.9.2025
Regularizing Extrapolation in Causal Inference
Julkaistiin: 27.9.2025
DoubleGen - Debiased Generative Modeling of Counterfactuals
Julkaistiin: 27.9.2025
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Julkaistiin: 27.9.2025
Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision
Julkaistiin: 27.9.2025

4 / 27

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

527 Jaksot

Training Agents Inside of Scalable World Models

Small Language Models are the Future of Agentic AI

Activation Steering in Generative Settings via Contrastive Causal Mediation Analysis

Eliciting Secret Knowledge from Language Models

Temporal difference flow

Personalized reasoning: just-in-time personalization and why LLMs fail at it

Prompt Curriculum Learning for Efficient LLM Post-Training

Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning

Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward

Learning to summarize user information for personalized reinforcement learning from human feedback

Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF

LIMI: Less is More for Agency

LoRA Without Regret

Actor-Critic without Actor: Critic-Guided Denoising for RL

DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?

Linear Transformers Implicitly Discover Unified Numerical Algorithms

Regularizing Extrapolation in Causal Inference

DoubleGen - Debiased Generative Modeling of Counterfactuals

What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision