Recent Posts

[논문리뷰] Judging with Confidence: Calibrating Autoraters to Preference Distributions

이 [arXiv]에 게시한 'Judging with Confidence: Calibrating Autoraters to Preference Distributions' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Imperceptible Jailbreaking against Large Language Models

이 [arXiv]에 게시한 'Imperceptible Jailbreaking against Large Language Models' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Hybrid Architectures for Language Models: Systematic Analysis and Design Insights

이 [arXiv]에 게시한 'Hybrid Architectures for Language Models: Systematic Analysis and Design Insights' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] HiKE: Hierarchical Evaluation Framework for Korean-English Code-Switching Speech Recognition

이 [arXiv]에 게시한 'HiKE: Hierarchical Evaluation Framework for Korean-English Code-Switching Speech Recognition' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Zeyi Liao이 [arXiv]에 게시한 'Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Good Intentions Beyond ACL: Who Does NLP for Social Good, and Where?

Denis Peskoff이 [arXiv]에 게시한 'Good Intentions Beyond ACL: Who Does NLP for Social Good, and Where?' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

이 [arXiv]에 게시한 'Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Boxiang Qiu이 [arXiv]에 게시한 'Factuality Matters: When Image Generation and Editing Meet Structured Visuals' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty

Xuanwu Wang이 [arXiv]에 게시한 'EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Epistemic Diversity and Knowledge Collapse in Large Language Models

이 [arXiv]에 게시한 'Epistemic Diversity and Knowledge Collapse in Large Language Models' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Code4MeV2: a Research-oriented Code-completion Platform

이 [arXiv]에 게시한 'Code4MeV2: a Research-oriented Code-completion Platform' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

이 [arXiv]에 게시한 'ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Character Mixing for Video Generation

이 [arXiv]에 게시한 'Character Mixing for Video Generation' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails

Xinyuan Liu이 [arXiv]에 게시한 'Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Fenglu Hong이 [arXiv]에 게시한 'Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning

Zeliang Zhang이 [arXiv]에 게시한 'AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning' 논문에 대한 자세한 리뷰입니다.

2025년 10월 7일

[논문리뷰] Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

Boyi Wei이 [arXiv]에 게시한 'Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents' 논문에 대한 자세한 리뷰입니다.

2025년 10월 6일

[논문리뷰] WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents

Neil Zhenqiang Gong이 [arXiv]에 게시한 'WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents' 논문에 대한 자세한 리뷰입니다.

2025년 10월 6일

[논문리뷰] Triangle Splatting+: Differentiable Rendering with Opaque Triangles

Matheus Gadelha이 [arXiv]에 게시한 'Triangle Splatting+: Differentiable Rendering with Opaque Triangles' 논문에 대한 자세한 리뷰입니다.

2025년 10월 6일

[논문리뷰] TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling

Juhan Nam이 [arXiv]에 게시한 'TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling' 논문에 대한 자세한 리뷰입니다.

2025년 10월 6일