Sources · arXiv

ArXiv: cat:cs.AI

Active

cat:cs.AI

Linked items 170

Errors 60

Last fetched 2026-05-14 00:06

Priority P1

Recent items

The 15 most-recently-ingested items from this source, newest first.

Title	Type	Fetched
AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward	article	2026-05-12
Learning, Fast and Slow: Towards LLMs That Adapt Continually	article	2026-05-12
Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training	article	2026-05-12
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents	article	2026-05-12
OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation	article	2026-05-12
Reward Hacking in Rubric-Based Reinforcement Learning	article	2026-05-12
KV-Fold: One-Step KV-Cache Recurrence for Long-Context Inference	article	2026-05-12
Solve the Loop: Attractor Models for Language and Reasoning	article	2026-05-12
Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs	article	2026-05-12
Enabling AI-Native Mobility in 6G: A Real-World Dataset for Handover, Beam Management, and Timing Advance	article	2026-05-12
EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction	article	2026-05-08
VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection	article	2026-05-08
Flow-OPD: On-Policy Distillation for Flow Matching Models	article	2026-05-08
Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning	article	2026-05-08
The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents	article	2026-05-08