Graduate Resrarch Assistant
,Department of Health Policy, University of Pittsburgh School of Public Health, Pittsburgh, PA
Provided statistical support for projects on the evaluation of the performance, access, capacity, and diversity of the
Behavioral Health Network in Allegheny County for Medicaid Enrollees such as performing web scrapping , identifying
trends, and data visualizations
Applied capture-recapture method to estimate the prevalence of opioid overdose in Allegheny County using Negative Binomial regression, and employed Bootstrap to estimate the confidence intervals for the combined estimates.
August 2021 - January 2024
Research Graduate Fellow
Biocomplexity Institute and Initiative, University of Virginia, Arlington, VA
Led two projects in support of the department of Social and Decision Analytics at the University of Virginia. Collaborated with research scientists and supervised and instructed undergraduate interns in data science framework. Built R Shiny Apps and presented to stakeholders.
In the R&D Text Corpora Filtering and Data Mining, performed natural language processing including sentence BERT embedding to retrieve articles about artificial intelligence (AI) from Federal RePORTER abstracts. Performed non-negative matrix factorization topic modeling on AI abstracts and identified emerging topics.
In the Defining and Measuring the Universe of Open Source Software Innovation, scraped GitHub repository information and classified repositories into various software types using term matching and sentence embeddings, which
allowed National Center for Science and Engineering Statistics understand how different types of software are used
within and across economic sectors
June 2021 - August 2021
Research Assistant
Department of Computational Oncology, Memorial Sloan Kettering Cancer Center, New York City
Applied Machine Learning to investigate metabolomics data from various cancers and predict unidentified metabolites. Implemented Lasso for its ability to reduce dimension, its high model performance, and its ease of interpretation. Studied the underlying structure of the metabolites via subpathways.Applied Bayesian Lasso and achieved more stable parameter estimates
June 2019 - August 2019