Overview
Work History
Education
Skills
Project History
Timeline
Generic

Diyun Lu

AI Engineer
Beijing,BJ

Overview

2
2
years of professional experience
7
7
years of post-secondary education

Work History

AI Engineer

TideSwing Technology Co., Ltd
Beijing, China
07.2020 - Current
  • Responsible for NLP and recommender system related projects.

NLP Engineer Intern

Neusoft Corporation Limited Advanced Product Development
Shenyang, China
04.2018 - 07.2018
  • Worked on the research and development of Intelligent Dialogue Robots - responsible for electronic medical record information processing, training of deep learning model, intention recognition and template matching with natural language processing.
  • Worked in the Project of Virtualized Standard Patient - responsible for electronic medical record information processing, the Front-End Website page processing.

Education

Master of Research - Biomedical Engineering (Computing Route)

University College London
London, England
09.2018 - 09.2019

Visiting Worker - Genetic Data Analysis

University of Cambridge - Wellcome Trust Sanger in
Cambridge, England
08.2018 - 12.2019

Bachelor of Science - Biomedical Engineering (Computing Route)

The Northeastern University
Shenyang, China
09.2014 - 07.2018

Exchange Student - Biomedical Engineering (Computing Route)

University of Silesia
Silesia, Poland
09.2016 - 02.2017

Skills

    Python

undefined

Project History

TideSwing Technology Co., Ltd. related projects 2020.07 - present

Recommender System Related

  • Background:

With the increase in the volume of users and the amount of dynamic submission data, a personalized recommendation system that can better meet the different preferences of different users is required.

  • Job description:

Worked on feature extraction based on business scenarios, and created user and text features through the two-tower model. Independently complete feature filtering, data preprocessing, model training and iterative systems to improve user click-through rates and retention rates.

  • Primary Techniques:

· vector search · ranking models (such as DIN, DIEN) · tree model · comparative learning

NLP Related

  • Background:

To complete the corresponding matching according to the user's posts, it is necessary to parse and process the text, process and analyze the user's intention and published data through multiple dimensions such as word meaning understanding and semantic analysis, and design a more accurate matching mechanism and plan.

  • Job description:

Responsible for the entire project from data collection, data labeling, model training, testing, deployment, and iterative optimization. The project mainly involves: named entity recognition, multi-label classification model, topic model, word segmentation, new word discovery and other NLP tasks.

  • Primary Techniques:

· Bert pretrained model · Fine-tuned · BiLSTM · LDA · CRF


User portrait

  • Background:

According to the user's basic attributes and text information, the user's interests and preferences are obtained, and the user portrait is constructed.

  • Job description:

For items that match user text, it is necessary to recommend more according to the user's personal preference, and complete the user portrait label independently

System construction, label extraction of user data, storage and query of graph database, and regular update, and integrate them into the recommendation system to improve the accuracy.

  • Primary Techniques:

· Label Extraction · Knowledge Graph · Graph Database

Sanger Institue University of Cambridge Prediction of Genetic Essentialism 2019.8 - 2019.12

  • Background:

It is necessary to predict the essentialism of genes in gene expression data by comparing the strategies of different models, which is an important preliminary analysis work for genetic editing related research.

  • Job description:

Worked on the Predictiion of the gene expression data given by the laboratory, and used different machine learning models such as linear regression and lasso regression to predict the essentialism of the corresponding gene.

  • Primary Techniques:

· Genetic Data Preprocessing · Linear Regression · Lasso regression


Combining Imaging Biomarkers and AI on medical records to shed light on Alzheimer's disease

(MRes project) 09.2018 - 08.2019

  • Background:

Data from Alzheimer's disease ADNI was used, including clinical data in the form of medical records, biological specimen data, genetic data, and imaging data. Predicting advanced Alzheimer's disease by analyzing medical records and other biomarker data of different patients.

  • Job Description:

Combine the machine learning and natural language processing techniques on imaging biomarkers and medical records, and reveal Alzheimer's disease through related research to achieve an earlier diagnosis.

  • Primary Techniques:

· LDA Topic modelling · Data processing and analysis with Python

· Deep Learning using LSTM · Medical Knowledge Graph

· Entity Extraction · Intention Recognition

Timeline

AI Engineer

TideSwing Technology Co., Ltd
07.2020 - Current

Master of Research - Biomedical Engineering (Computing Route)

University College London
09.2018 - 09.2019

Visiting Worker - Genetic Data Analysis

University of Cambridge - Wellcome Trust Sanger in
08.2018 - 12.2019

NLP Engineer Intern

Neusoft Corporation Limited Advanced Product Development
04.2018 - 07.2018

Exchange Student - Biomedical Engineering (Computing Route)

University of Silesia
09.2016 - 02.2017

Bachelor of Science - Biomedical Engineering (Computing Route)

The Northeastern University
09.2014 - 07.2018
Diyun LuAI Engineer