Large Language Modelling/GenAI Engineer

Niuwa
Human Resources Management / Consultancy
Posted 3 months ago
HK $45K-68K/Month
Master
5 to 10 years

Send application message

Job DescriptionTranslate to English
<p>Duties include:<br>• Model training and optimization: Design and implement large language model training strategies, including supervised fine-tuning (SFT), reinforcement learning (such as GRPO, PPO), etc., to enhance the model's intelligence in the Web3 field.<br>• Data processing and generation: Build a high-quality training dataset, perform data distillation and Long&Short Chain of Thought (Long&Short Chain of Thought, CoT) data generation to ensure that the model has strong inference abilities.<br>• Model architecture and evaluation: Explore and apply advanced model architectures such as expert mixing (MoE), develop model evaluation frameworks and metrics, and continuously optimize model performance.<br>• Distributed training and deployment: Develop and maintain distributed training schemes for models to ensure efficient training and stable deployment. <br>• Technological frontier exploration: Track the latest research dynamics in the AI field, such as OpenAi GPT-4.5, DeepSeek-R1, etc., and promote technological innovation and application in actual business. <br>Position requirements<br>• Educational background: Bachelor's degree in Computer Science, Artificial Intelligence, Machine Learning or related fields, with preference for Master's or Doctoral degrees.<br>• Technical skills:<br>• Proficient in Transformer architecture, familiar with Transformer Reinforcement Learning (TRL), PyTorch or TensorFlow deep learning-based learning frameworks, etc.<br>• Have large language model fine-tuning experience, familiar with reasoning-oriented reinforcement learning (Reasoning-Oriented Reinforcement Learning, RORL) technology. <br>• Familiar with distributed training frameworks, with practical experience in model parallelism, Flash Attention, LoRA, etc.,<br>• Engineering capability:<br>• Proficient in Python, Go, etc. programming language, with good coding style and software engineering practical experience.<br>• Familiar with model serving technologies such as Triton, vLLM, TGI, etc., those with inference optimization experience are preferred.<br>• Research ability:<br>• Able to read and implement cutting-edge papers, write technical reports or blogs.<br>• Priority will be given to those with papers published or open-source project contributions at top conferences (such as NeurIPS, ICLR, ICML, ACL).<br>• Soft skills:<br>• Have excellent team collaboration and communication skills, and be able to work efficiently with cross-functional teams.<br>• In-depth understanding of open-source AI communities, with contributors to relevant projects being preferred.</p>

Languages
English
Skills
Software Engineering
大模型训练

avatar
avatar
班云
Niuwa · 招聘专家
Active recently

Job Location

環球貿易廣場-West Kowloon Reclamation, Yau Tsim Mong

西九龍 Austin Rd W, 1號環球貿易廣場

Map info not available. You can open it in another map app.

Location

Direction


Be careful

Don’t provide your bank or credit card details when applying for jobs.

Send application message

Similar jobs
View more

3 to 5 years


Bachelor

$45K-65K/Mth

Collaborate on innovative AI solutions with business units

3-6 years AI engineering or related experience required

Proficiency with public cloud platforms (AWS, Azure, AliCloud)

Quick reply

1 to 3 years


Bachelor

$10K-30K/Mth

专注于机器人触觉技术

提供签证协助

嵌入式算法开发经验

靈犀未來

  • Active today

Quick reply

5 to 10 years


Bachelor

$40K-60K/Mth

Quick reply

1 to 3 years


Master

参与构建HKGAI大模型

熟悉C/C++或Python

NLP、CV相关经验优先

1 to 3 years


Bachelor

$18K-30K/Mth

汇纳科技

  • Active recently

1 to 3 years


Master

参与构建HKGAI大模型

熟悉C/C++或Python

大模型领域项目或论文经验优先