Summary

šŸ”¹ NLP Data Scientist specializing in Large Language Models (LLM), focusing on model development and deployment, with expertise in fine-tuning, alignment, deployment, and quantization.

šŸ”¹ Develops scalable and efficient LLM solutions for e-commerce, leveraging foundational models to enhance buyer experience and improve customer service efficiency.

Years of Experience

2.5 years

Residence

Singapore

Title

Senior Engineer | NLP Data Scientist (LLM)

Skills

  • Programming Languages

    Python (proficient, including experience with PyTorch and TensorFlow)

    SQL (Presto)

    Familiar with Shell, Bash, and common scripting languages

  • Large Model Development

    Model fine-tuning (SFT) & alignment (DPO, KTO)

    Quantization, inference acceleration, multimodal models

    Mainstream open-source LLM frameworks

  • Language Proficiency

    Fluent in Chinese and English, capable of writing technical documents & daily communication

Education

  • Institute of Automation, Chinese Academy of Sciences (CASIA) - Computer Science and Technology
  • China University of Mining and Technology - Electronic Information Science and Technology

Work Experience

  • Shopee
  • Tencent

CV Summary

4
Projects
2
Companies
2
Languages