I'm Zichen Tian
Central South University (CSU) β B.S. in Computer Science, Turing Class
Sept 2021 β June 2025The Chinese University of Hong Kong (CUHK) β M.Sc. in Artificial Intelligence
Aug 2025 β June 2027
Apr 2025 β Sep 2025
- LLM reasoning, Agent, context engineering
Nov 2024 β Mar 2025
- SFT, DPO, workflow
- Large Language Models (LLM): Continued Pre-training, SFT, Preference Alignment (DPO/RFT/GRPO/DAPO)
- Reinforcement Learning (RL): Reward Modeling, MAB, Agent Decision-Making, RLHF/RLAIF
- Agents: Tool Usage, Workflow Orchestration (LangGraph), Multi-Agent Collaboration, Autonomous Planning
- RAG: Multi-path Retrieval, Re-ranking, Knowledge Base Construction, Low-latency Inference
- Languages: Python (Proficient), Java (Proficient), C/C++, Shell, SQL, HTML/CSS/JS, Matlab
- Frameworks: PyTorch, DeepSpeed, vLLM, VeRL, Open-R1, Ray, Scikit-learn
- Tools: Linux, Git, Docker, Spring Boot, Flask, QT, MyBatis-Plus, LangGraph
- Models & Libraries: HuggingFace, ModelScope, FAISS, BM25, BGE/GTE series, VLLM, Alphapose
- π§ Email: [email protected]
- πΌ GitHub: github.com/Togetabetterplace
- π― Interests: LLM Β· Post-training Β· RL Β· Agent
My GitHub name reflects my mission: using tech to build a better world.