Summary

🔹 NLP Data Scientist specializing in Large Language Models (LLM), focusing on model development and deployment, with expertise in fine-tuning, alignment, deployment, and quantization.

🔹 Develops scalable and efficient LLM solutions for e-commerce, leveraging foundational models to enhance buyer experience and improve customer service efficiency.

Years of Experience

2.5 years

Residence

Singapore

Title

Senior Engineer | NLP Data Scientist (LLM)

Work Experience

Shopee

Senior Engineer | NLP Data Scientist (LLM)

August 2022 - Now

Shopee - Marketplace Intelligence - Chatbot

  • As a Senior Engineer, I specialize in LLM fine-tuning, deployment, and optimization to enhance customer interactions and improve chatbot performance in e-commerce. My work covers millions of stores across 8 key markets, leading to improved customer satisfaction, higher order conversion rates, and reduced customer service workload.

Tencent

Recommendation Algorithm Engineer Intern

April 2021 - October 2021

Tencent News - Recommendation Algorithm Center

  • As a Recommendation Algorithm Engineer Intern, I worked on optimizing recommendation models and improving feature selection to enhance content personalization and user engagement.

Projects

LLM Shop Chatbot - Topic Recognition

Senior Engineer | Data Scientist

Automatically identify buyer inquiry intentions using Prompt-Guided multi-classification and model fine-tuning, reducing manual customer service transfers.

LLM Shop Chatbot - Product QA

Engineer | Data Scientist

Combined product detail page text and images for large model fine-tuning and alignment, enhancing multi-modal answering capabilities.

LLM Shop Chatbot - Sales Guide

Engineer | Data Scientist

Parsed user needs through Function Call, invoked product search, and provided accurate recommendations based on Document QA.

News Recommendation Model Optimization

Engineer (Intern)

undefined 2021 - undefined 2021

Optimized the double tower model structure, filtered and added effective features, resulting in a 0.8% increase in click-through rate (CTR).

Skills

  • Programming Languages

    Python (proficient, including experience with PyTorch and TensorFlow)

    SQL (Presto)

    Familiar with Shell, Bash, and common scripting languages

  • Large Model Development

    Model fine-tuning (SFT) & alignment (DPO, KTO)

    Quantization, inference acceleration, multimodal models

    Mainstream open-source LLM frameworks

  • Language Proficiency

    Fluent in Chinese and English, capable of writing technical documents & daily communication

Education

Institute of Automation, Chinese Academy of Sciences (CASIA)

Computer Science and Technology - Master's

2019 - 2022

China University of Mining and Technology

Electronic Information Science and Technology - Bachelor's

2015 - 2019

Languages

Language

Chinese

Mother Tongue

Language

English

Professional