Based in Guangzhou!!!
Key Responsibilities:
- Lead End-to-End Development:Architect, build, and deploy our core Intelligent Document Processing platform and AI services.
- Apply Advanced AI:Utilize both traditional OCR and modern Visual Language Models (VLM) to extract insights from complex, large-volume documents.
- Drive Optimization:Ensure the scalability, reliability, and efficiency of our AI solutions in a production environment.
- Collaborate & Guide:Work closely with cross-functional teams and mentor junior talents to foster innovation.
We're Looking For:8+ years of experience in AI/Machine Learning and Software Engineering, with a proven record of delivering complex projects.
Technical Must-Haves:
- Proficiency inPythonand strong software engineering principles (CI/CD, DevOps, Kubernetes).
- Hands-on experience withOCR(e.g., Tesseract) andVisual Language Models (VLM)(e.g., Qwen VL, Llama Vision).
- Solid background inNLP, ML, LLM fine-tuning, and database technologies (MongoDB, PostgreSQL, Elasticsearch).
- Experience inAI-powered document automation(classification, information extraction) is essential.
- Excellent problem-solving and communication skills, with the ability to thrive in a collaborative setting.