Sources · arXiv

ArXiv: cat:cs.AI

Active

cat:cs.AI

Linked items 170
Errors 60
Last fetched 2026-05-14 00:06
Priority P1

Edit

Human-friendly label. Leave blank to fall back to the identifier.

Drives ingestion cadence and the default ranking weight applied to items from this source.

How often the watch loop polls this source. Minimum 5 min. Leave blank for the per-type default.

Optional. Pin a specific summarization model for items from this source.

Cancel

Recent items

The 15 most-recently-ingested items from this source, newest first.

Title Type Fetched
AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward article 2026-05-12
Learning, Fast and Slow: Towards LLMs That Adapt Continually article 2026-05-12
Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training article 2026-05-12
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents article 2026-05-12
OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation article 2026-05-12
Reward Hacking in Rubric-Based Reinforcement Learning article 2026-05-12
KV-Fold: One-Step KV-Cache Recurrence for Long-Context Inference article 2026-05-12
Solve the Loop: Attractor Models for Language and Reasoning article 2026-05-12
Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs article 2026-05-12
Enabling AI-Native Mobility in 6G: A Real-World Dataset for Handover, Beam Management, and Timing Advance article 2026-05-12
EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction article 2026-05-08
VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection article 2026-05-08
Flow-OPD: On-Policy Distillation for Flow Matching Models article 2026-05-08
Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning article 2026-05-08
The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents article 2026-05-08