Job function
Hong Kong
Work type
HK$
Listed any time
Experience
Education level
Benefit
Industry
Job function
Hong Kong
Work type
HK$
Listed any time
Experience
Education level
Benefit
Industry

 About 606 jobs

Create job alert for the latest job openings of this search

Quick reply

1 to 3 years


Master

$40K-55K/Mth

Design and develop Large Language Model (LLM) training platform

Master's degree in Computer Science, Communications, Electronics or related

At least five years of solid experience in MLOps fields at supervisory level

Quick reply

No experience limit


No degree required

$30K-45K/Mth

Highly competitive remuneration package

Experience in AI training platform/HPC operations preferred

Bachelor's degree in Computer Science, Communications, Electronics or related field required

Quick reply

1 to 3 years


Higher Diploma or Associate Degree

$35K-45K/Mth

Manage and support large-scale enterprise servers

Higher Diploma/Degree (IT-related preferred)

Strong hands-on RHEL/CentOS skills

Grey Anderson Limited

  • Active within 3 days

Quick reply

New

auth medal

Active Recruiter

No experience limit


No degree required

$30K-35K/Mth

Expert-level on-premises VDI support

Minimum 6 years professional VDI experience

Proficiency in Citrix or Sangfor solutions

Classy Wheeler

  • Active today

1 to 3 years


Master

精通C++或Python

熟悉机器人操作系统(ROS/ROS2)

有带有力反馈功能的遥操作系统开发经验者优先

Quick reply

New

auth medal

Active Recruiter

No experience limit


Bachelor

$35K-65K/Mth

No experience limit


Bachelor

Assist in developing front-end interfaces and back-end logic

Support firmware development, implementing Bootloader functions

Bachelor's degree in Computer Science/Electronics/Automation, proficient in C/C++

No experience limit


No degree required

$8K-15K/Mth

响应式网页和移动应用开发

1-3年数字开发经验,熟悉MVC框架

优秀英语和中文书面能力,会粤语优先

JUSTONEGALAXY

  • Active today

No experience limit


Bachelor

$18K-27K/Mth

Experience in Linux environment development, skilled in Shell scripting

Bachelor's degree in Computer Science/Electronics/Automation, proficient in C/C++

Project experience in embedded systems, knowledge of RTOS or Linux kernel

3 to 5 years


Bachelor

$23K-35K/Mth

Research AI-driven solutions for operational efficiency

Experience in AI, machine learning, or data analytics projects.

Bachelor’s or Master’s degree in Computer Science, AI, Data Science

LocoBike

  • Active today

Didn't find a job you're interested in?

Subscribe to this search, and to be notified when new positions posted

MLOps R&D Engineer

New tab
The Hong Kong Polytechnic University Academy of Advanced Artificial Intelligence (PAAI)
Education & Research
Updated within 3 months
HK $40K-55K/Month
Master
1 to 3 years
Full Time
9.0 hrs/day, 5 days/wk

Send application message

Job Description

Duties

The appointee will be required to work for one of the constituent research units – Research Institute for Generative AI (RIGAI) (to be established) under the PolyU Academy for Artificial Intelligence (PAAI). The appointee will be required to:

(a) take responsibility for the design and development of a Large Language Model (LLM) training platform, developing unified capabilities for GPU resource pooling, training job scheduling, inference acceleration and the Machine Learning Operations (MLOps) platform to support efficient model training iteration;

(b) lead the construction of the GPU computing cluster centered around a Kubernetes + NVIDIA GPU Operator, including node planning, resource management, scheduling policies and container runtime environment setup (Docker/Containerd);

(c) build the software stack for the NVIDIA cluster, including CUDA, NVIDIA drivers, Fabric Manager, PyTorch Distributed and NCCL communication, to ensure high performance and stability for distributed training;

(d) design and implement critical infrastructure components and toolchains for the training platform, including training task orchestration and automated pipelines, unified base image system (CUDA + PyTorch), data loading and data distribution components, and training artifact management and model version management;

(e) collaborate with the LLM team to support the implementation, optimisation and efficiency improvement of distributed training for framework layers (PyTorch Distributed, Megatron, SGLang) on the platform;

(f) participate in building the monitoring and observability system, covering GPU metrics, NCCL communication, IB network, storage I/O and Pod runtime status as well as establish alerting strategies;

(g) write platform construction documentation, development specifications, and automation scripts and tooling (Python/Go/Bash/Terraform) to enhance engineering consistency and delivery quality; and

(h) perform any other duties as assigned by the Director of PAAI or his delegates.

Qualifications

Applicants should:

(a) have a master’s degree or above in Computer Science, Communications, Electronics or a related discipline;

(b) have at least five years of solid experience in the MLOps fields at supervisory level;

(c) have a basic understanding of LLM training processes, multimodal models and AI Agents;

(d) be familiar with the overall training, inference and evaluation pipeline;

(e) be proficient in mainstream languages such as Python or Go with good engineering skills, coding standards and backend development capabilities;

(f) be familiar with LLM-related training frameworks such as PyTorch, PyTorch Distributed, SGLang and Megatron;

(g) have knowledge of Kubernetes and its GPU scheduling ecosystem, including GPU Operator, container runtime, image building and pipeline engineering processes;

(h) be familiar with NVIDIA Hopper GPU architecture, NCCL communication, InfiniBand network, GPU/NVLink topology and performance bottlenecks;

(i) have knowledge of HDFS, JuiceFS, GPFS or similar large-scale data access systems, and an understanding of training data reading bottlenecks;

(j) have experience in foundational infrastructure technologies such as Ray, message queues, backend storage and API services being an advantage;

(k) have experience in platform engineering, training platform development, MLOps or distributed systems development being an advantage;

(l) be capable of translating model team requirements into engineered solutions;

(m) have good communication skills; and

(n) have a good command of both written and spoken English and Chinese.

Applicants with less supervisory experience will be considered for the post of Engineer.


Languages
Cantonese
English
Mandarin
Skills
IT Infrastructure

avatar
avatar
HR WU
The Hong Kong Polytechnic University Academy of Advanced Artificial Intelligence (PAAI) · HR
Active within 3 days

Job Location

香港理工大學-Hung Hom, Kowloon City

紅磡育才道11號

Map info not available. You can open it in another map app.

Location

Direction


Company Overview

The Hong Kong Polytechnic University (English: The Hong Kong Polytechnic University, abbreviation: PolyU), commonly known as PolyU, is a public applied research university located in Kowloon Tong, Hong Kong. Its predecessor was the Hong Kong Government's High-level Industrial College established in 1937, which has undergone multiple development stages and was upgraded to a university in 1994, becoming one of the eight universities funded by the University Grants Committee (UGC). PolyU is one of Hong Kong's top universities and is also ranked among the world's top 100 universities in three global rankings - QS World University Rankings, Times Higher Education World University Rankings (THE), and U.S. News & World Report's Best Global Universities (U.S. News). Its establishment documents and charter refer to the Hong Kong Legislative Council's "Chapter 1075 The Hong Kong Polytechnic University Ordinance". The Hong Kong Polytechnic University Artificial Intelligence High-level Research Institute (PAAI) is affiliated with the Hong Kong Polytechnic University and was established on April 1, 2025. Its inauguration ceremony was presided over by the Secretary for Innovation, Science and Technology and Industry, Sun Don, and the President of the University, Teng Jin-kong. The newly established research institute combines computer science, mathematics and data science within the university, hoping to strengthen international cooperation and assist in building Hong Kong into an AI innovation hub.


Be careful

Don’t provide your bank or credit card details when applying for jobs.

Send application message

Career Guide