Contact

Aspiring Data Scientist

Skills

Programming Skills: R, Python, Matlab, SAS(Base/Advanced Certificates), SQL, LaTex;

Machine Learning: Clustering, Association, Classification, Regression, Bayesian Regression;

AWS: EC2, RDS, S3, SageMaker

Education

Rutgers University: M.S., Computer Science, 3.87 (2017 - 2019)

  • Massive Data Storage, Retrieval and Deep Learning(CS 543)
  • Natural Language Processing(CS 533)
  • Pattern Recognition(CS 535)
  • Machine Learning(CS 536)
  • Artificial Intelligence(CS 520)
  • Data Structure and Algorithms(CS 512)
  • Operating Systems(CS 518)
  • Numerical Analysis(CS 510)
  • Principle of Information and Data Management(CS 336)

Rutgers University: M.S., Statistics, 3.85/4.0 (2015 - 2017)

Statistics(Data Mining Track): MASTER OF SCIENCE - MAY 2017

  • Probability(STAT 580)
  • Linear Regression(STAT 563)
  • Statistical Inference(STAT 583)
  • Applied Multivariate Analysis(STAT 567)
  • Interpretation of Data I (STAT 586)
  • Interpretation of Data II (STAT 587)
  • Data Mining(STAT 588)

Course Projects:

  • Community Ride Share System(HTML, JSP, MySQL, AWS)
  • National Institute of Justice Crime Forecasting(Python)
  • Author Prediction from Tweets(R)
  • Discover Region Functionality in A City(Python)
  • Fast Trajectory Replanning(Matlab: Repeated Forward/Backward A, Adaptive A algorithms )
  • Image and Text Classification(Matlab: Naive Bayes, Perceptron, KNN, SVM)

University of Wisconsin-Madison: Non-degree, Mathematics, 4.0 (2014 Fall)

  • Math 340 Elementary Matrix and Linear Algebra
  • Math 421 The Theory of single variable Calculus

Peking University

  • B.S./M.S./Ph.D, Pharmaceutical Science

Working Experience

Analytics Engineer(Statistician II), Process Innovation

Ortho Clinical Diagnostics (November 2017 - current), New Jersey, USA

. Advanced Analytics: Text Analytics/ Machine Learning(R/Python)

. Routine Quality Analytics: Rmarkdown/SAS/Tableau/Excel

. Ad-hoc Analysis: Performing a series of ad-hoc analysis using R and Python to help draw business insights and support decision making upon requests from leadership team, such as business process mining, financial revenue forecasting(Gaussian Process, AWS DeepAR).

. Data Engineering/Management: Mapping current data status, developing data portal for easy access, collaboration and management(R shiny App).

. Rshiny Web App Develop/Deploy: MySQL/PostgreSQL, R /Markdown /Bookdown /Flexdashboard /dplyr/ htmlwidgets /Shiny Server

. Training: Develop tutorial for department training, including basic statistics, data manipulation /analysis /interpretation(Rbookdown book).

Clinical Statistical Programmer(SAS), Medical Affairs

BDM Consulting (June 2017 - October 2017), New Jersey, USA

. Exposed to extensive training about Clinical Trial SAS Programming, Clinical Trial Statistics, CDISC SDTM/ADaM;

. Generated SDTM/ADaM validation datasets according to mapping specification;

. Supported senior programmer within a study team, such as creating and validating QC datasets for TFL’s generation;

. Data manipulation upon requests from clients;

. Prepared clinical study summary reports(ppt) for statistician, reviewed and validated study poster draft.

Healthcare Equity Research Analyst, Research Institute

HWABAO Securities (February 2012 - July 2013), Shanghai, China

  • Note: this was generated with Rmarkdown