User Papers

#TitleLink
1DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning2501.12948.html
2MangaNinja: Line Art Colorization with Precise Reference Following2501.08332.html
3MiniMax-01: Scaling Foundation Models with Lightning Attention2501.08313.html
4TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks2412.14161.html
5Long Context vs. RAG for LLMs: An Evaluation and Revisits2501.01880.html
6rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking2501.04519.html
7DeepSeek-V3 Technical Report2412.19437.html
8Qwen2.5 Technical Report2412.15115.html
9A Survey on Large Language Model based Autonomous Agents2308.11432.html
10Evaluating and Aligning CodeLLMs on Human Preference2412.05210.html
11Shiksha: A Technical Domain focused Translation Dataset and Model for Indian Languages2412.09025.html
12AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials2412.09605.html
13Phi-4 Technical Report2412.08905.html