PhD candidate
Hello. Thanks for visiting my website.
I'm Yunsoo Kim, a PhD student at University College London
under the supervision of Professor Honghan Wu.
Previously, I was a professional (senior research scientist) and NLP group leader at LG Chem,
and a co-founder of Torrey Pines Co., Ltd. (microbiome science cosmetics company).
I received my MSc in Bioinformatics and Theoretical Systems Biology from Imperial College London,
where I developed a method to analyze -omics time series using machine learning with Professor Tim Ebbels,
and I developed a method to infer networks from metagenomics abundance data with Dr. John Pinney and Dr. Virginia Fairclough.
I received my B.S. in Systems Biology with Honors (as well as concentration in Bioinformatics and Pre-Health Plan) from Case Western Reserve University,
where I made an assembly of Marama chloroplast from Next-gen sequencing reads with Professor Christopher Cullis.
My current work is focused on natural language processing (NLP) and its applications in health informatics.
My research interest is Foundation Model for medicine,
specifically in Multimodal LLMs for histopathology and radiology by collating clinical expert knowledge.
For each work, a brief description is provided as well as external links, if possible.
Multimodal LLM for biomedical domain.
I was an organiser of the tutorial at HealTac2024 and MICCAI2024.
I am currently organising the tutorial and workshop at MICCAI2025.
Thesis Proposal Presented at ACL2024 Student Research Workshop.
A novel approach to enhance human-computer interaction in chest X-ray analysis
with radiologists’ attention by incorporating eye gaze data alongside textual prompts for VLMs.
Presented at IJCAI2024 TAI4H workshop and Accepted at MICCAI2024 for poster presentation.
This study presents an automated pipeline, IHC-LLMiner,
for extracting IHC-tumour profiles from PubMed abstracts,
leveraging large language models. Our fine-tuned model outperformed GPT4O.
The work is under review.
We address a major gap in current medical QA benchmarks
which is the absence of comprehensive assessments of LLMs’ ability to generate nuanced medical explanations.
This work also proposes a new medical model, MedPhi-2,
which outperformed medical LLMs based on Llama2-70B in generating explanations.
The work is presented at ACL2024 BioNLP workshop as a poster.
Working on language (pre-trained) models in chemistry, materials science, and synthetic biology patents and literature.
Working on knowledge graphs construction using NLP and GNN.
Developing a system called CLUE (chemical langauge understanding expert),
which is an artificial reading system empowered by the fine-tuned models such as chemical entity recognition models.
Received 1st place in 2021 LG AI DX IDEAthon business applications sector.
Published in ACL2023 Industry Track. 5 patents for this research.
Early Allergen and Toxin assessment system 2018
Developed a neural network to predict protein toxicity with protein amino acid sequence as input.
precision 2% higher and prediction speed 22 times faster than SVM (which was the SOTA).
Co-founder 2015 - present & CEO 2016 - 2017
Torrey Pines is a cosmetics company specialized in hair care products.
Introduced microbiome science to hair care cosmetics marketing.
Successfully exported to Vietnam and Cambodia in 2017.
Launched a new brand called "Bota nouveau" in Cambodia (2019) and Korea (2020).
Led UCL team for MICCAI2024 tutorial
The largest tutorial at MICCAI2024
More than 100 attendees.
Received the Best Award in LG for AI research
Given by LG Group Chairman
Due to COVID19, LG Chem CEO
handed out the award.
Introduced microbiome science
to hair care cosmetics.
Actively used microbiome science for marketing.
I received FACT Graduate Scholarship which supports outstanding PhD students
in innovative, quantitative, and interdisciplinary research.
Received DX Frontier 1st place in applications sector
Given by LG Science Park
Was part of LG AI IDEAthon
Handed out the award in Metaverse.
Mentoring of 4 researchers at LG in 2020
One mentee received best award from the mentoring.
He continued working on it and received LG best practice in 2021.