Duties include:
• Model training and optimization: Design and implement large language model training strategies, including supervised fine-tuning (SFT), reinforcement learning (such as GRPO, PPO), etc., to enhance the model's intelligence in the Web3 field.
• Data processing and generation: Build a high-quality training dataset, perform data distillation and Long&Short Chain of Thought (Long&Short Chain of Thought, CoT) data generation to ensure that the model has strong inference abilities.
• Model architecture and evaluation: Explore and apply advanced model architectures such as expert mixing (MoE), develop model evaluation frameworks and metrics, and continuously optimize model performance.
• Distributed training and deployment: Develop and maintain distributed training schemes for models to ensure efficient training and stable deployment.
• Technological frontier exploration: Track the latest research dynamics in the AI field, such as OpenAi GPT-4.5, DeepSeek-R1, etc., and promote technological innovation and application in actual business.
Position requirements
• Educational background: Bachelor's degree in Computer Science, Artificial Intelligence, Machine Learning or related fields, with preference for Master's or Doctoral degrees.
• Technical skills:
• Proficient in Transformer architecture, familiar with Transformer Reinforcement Learning (TRL), PyTorch or TensorFlow deep learning-based learning frameworks, etc.
• Have large language model fine-tuning experience, familiar with reasoning-oriented reinforcement learning (Reasoning-Oriented Reinforcement Learning, RORL) technology.
• Familiar with distributed training frameworks, with practical experience in model parallelism, Flash Attention, LoRA, etc.,
• Engineering capability:
• Proficient in Python, Go, etc. programming language, with good coding style and software engineering practical experience.
• Familiar with model serving technologies such as Triton, vLLM, TGI, etc., those with inference optimization experience are preferred.
• Research ability:
• Able to read and implement cutting-edge papers, write technical reports or blogs.
• Priority will be given to those with papers published or open-source project contributions at top conferences (such as NeurIPS, ICLR, ICML, ACL).
• Soft skills:
• Have excellent team collaboration and communication skills, and be able to work efficiently with cross-functional teams.
• In-depth understanding of open-source AI communities, with contributors to relevant projects being preferred.