Deli Chen

陈德里

Senior Researcher at DeepSeek AI. Building next-generation large language models.
Core contributor to DeepSeek-V1, V2, V3, V4, DeepSeek-R1 (Nature Cover), DeepSeek-Coder, DeepSeek-MoE architecture, etc.
B.S. & M.S. from Peking University · Previously Tencent WeChat AI.

All opinions are my own. | INTP-T | 人心惟危，道心惟微｜ #AGIforEveryone

23,500+

Citations

21

h-index

29+

Papers

Experience

2023 - Present

Senior Researcher, DeepSeek AI

Core contributor to DeepSeek-V1, V2, V3, V4, DeepSeek-R1 (Nature Cover), DeepSeek-Coder, DeepSeek-MoE architecture, etc. Public spokesperson at NVIDIA GTC 2024 and World Internet Conference 2025.

2021 - 2023

Researcher, Tencent WeChat AI

NLP and language model research.

2019 - 2021

M.S. in Computer Science, Peking University

MOE Key Lab of Computational Linguistics (LancoPKU). Advisor: Prof. Xu Sun. Research on GNN, NLP, and financial AI.

2015 - 2019

B.S. in Information Management, Peking University

School of Information Management.

Research

🧠

Large Language Models

DeepSeek series: scaling, MoE architecture, efficient training.

💡

Reasoning & RL

RL for LLM reasoning (DeepSeek-R1), step-by-step verification.

🕸️

Graph Neural Networks

Over-smoothing, topology-imbalance, contrastive learning.

🔒

LLM Safety & Alignment

Backdoor detection, watermarking, diffusion purification.

📈

Financial NLP

Stock prediction with event graphs, forex news aggregation.

🔍

Interpretability

In-context learning: label words as anchors.

Selected Publications

DeepSeek-V4 Technical Report

2025New

—

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Nature 2025 (Cover Article)Nature Cover

9,541

DeepSeek-V3 Technical Report

arXiv 2024

4,600

Measuring and Relieving the Over-smoothing Problem for Graph Neural Networks from the Topological View

AAAI 2020

1,748

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

ACL 2024

1,017

Math-Shepherd: Verify and Reinforce LLMs Step-by-Step without Human Annotations

ACL 2024

993

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

arXiv 2024

952

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

arXiv 2024

926

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

arXiv 2024

467

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

EMNLP 2023Best Long Paper

284

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

arXiv 2025

281

Modeling the Stock Relation with Graph Network for Overnight Stock Movement Prediction

IJCAI 2021

281

Topology-Imbalance Learning for Semi-Supervised Node Classification

NeurIPS 2021

173

Towards Codable Text Watermarking for Large Language Models

ICLR 2023

139

Fed-FA: Theoretically Modeling Client Data Divergence for Federated Language Backdoor Defense

NeurIPS 2023

CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade

Findings of EMNLP 2021

72

Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models

Findings of ACL 2023

17

View all publications on Google Scholar →

Talks & Appearances

NVIDIA GTC 2024

Invited talk: "Harmony in Diversity: Decoupled Value Alignment for LLMs"

World Internet Conference 2025

Wuzhen Summit: "Six Little Dragons Dialogue" — Representing DeepSeek on open-source AI future.

News & Highlights

DeepSeek-R1 published as Nature Cover Article

Sep 2025Nature Cover

Vol. 645

"Label Words are Anchors" wins EMNLP 2023 Best Long Paper

Dec 2023Best Long Paper

DeepSeek-R1 crosses 9,500 citations in 4 months

May 2025

Lab & Collaborators

Research Lab

L

LancoPKU Lab

Language Computing & Web Mining, Peking University

Advisor

X

Xu Sun

Professor, Peking University

DeepSeek AI

D

Damai Dai

DeepSeek AI / Peking University

R

Runxin Xu

DeepSeek AI / Peking University

Z

Zhihong Shao

DeepSeek AI / Tsinghua University

P

Peiyi Wang

DeepSeek AI / Peking University

Academia

Associate Professor, Renmin University

Institute for AI Industry Research, Tsinghua

Q

Qi Su

Associate Professor, Peking University

Renmin University of China

Industry

Beijing Language and Culture University

R

Ruihan Bao

Mizuho Securities Co., Ltd.

Blog

How Code Agents Saved My INTP Brain (代码 Agent 如何拯救了我的 INTP 大脑)

April 2026Popular

A candid reflection on how Code Agents broke the classic INTP overthink-never-execute cycle, the new bottlenecks they reveal, and 9 survival rules for using them without losing your mind. EN/CN bilingual.

Never Stop Learning: A Survey of Continual Learning and Self-Iteration in Large Language Models

May 29, 2026New

The first unified survey bridging continual learning and self-improvement for LLMs. Proposes a three-dimensional taxonomy (what/how/when) covering 100+ papers on knowledge updates, skill acquisition, alignment drift, and test-time adaptation. Includes a reproducible benchmark comparing 12 methods across 5 scenarios.

A Survey on LLM-based Automated Research

May 26, 2026New

A comprehensive survey exploring how large language models are transforming the scientific research pipeline — from literature review and hypothesis generation to experiment design and paper writing. This work covers the latest advances in automated research agents and discusses the opportunities and challenges ahead.

Chill Projects

Side projects for fun — 摸鱼时做的小玩意儿。

文明六百科图鉴 (Civilization VI Wiki)

2026Interactive

A comprehensive Civ6 encyclopedia featuring 35+ civilizations, game mechanics, victory strategies, mod guides (BBG/和而不同), and an AI advisor powered by DeepSeek. Fully searchable with interactive comparison tools.