JL
Juhyoung Lee

Data & ML Engineer

MS in Data Analytics Engineering @ Northeastern · GPA 4.0

Academic

Projects

Data

LLM-based Signal Extraction from Recognition Messages

Jan 2026

A research project in collaboration with Workhuman. Extracts promotion patterns from employee award messages using an LLM pipeline, validated with statistics.

  • Extracted promotion-related signals from employee award messages using an LLM pipeline (Workhuman collaboration)
  • Validated reliability with Bootstrap CI, effect size, and 4-model cross-validation
LLMNLPStatistical ValidationInter-model ValidationHR Analytics
View →
ML

VQA with BLIP-2 + Phi-1.5 (Sub-3B Constraint)

Aug 2025

Customized BLIP-2 architecture by replacing OPT with Phi-1.5 to meet a sub-3B parameter budget. Fine-tuned Q-Former with multi-stage training. Ranked 43rd / 242 teams (top 18%).

  • Customized BLIP-2 with Phi-1.5 LLM under 3B parameter and pre-2024 model constraints
  • Multi-stage fine-tuning (captioning → VQA → reasoning → closed-ended); 0.7665 weighted accuracy, top 18%
MultimodalVQABLIP-2Phi-1.5Q-FormerVision-LanguageFine-tuning
View →
ML

Fine-grained Car Image Classification (396 Classes)

Mar 2025

Multi-class image classification of used car images into 396 make/model/year classes. Background removal with YOLO, ConvNeXt-Base fine-tuning, and TTA + soft-voting ensemble. Ranked 134th / 748 teams (top 17.9%).

  • Built fine-grained image classifier for 396 car make/model/year classes (top 17.9%, Hecto AI Challenge)
  • Background removal with YOLO + ConvNeXt-Base fine-tuning + TTA + soft-voting ensemble
Computer VisionImage ClassificationFine-grained ClassificationConvNeXtFine-tuningYOLOTTAEnsemble
View →
Data

Pet Hotel — Relational Database Design with MySQL

Dec 2024

Designed and implemented a relational database for a pet hotel business using MySQL. Covers ER modeling, schema normalization, and complex SQL queries (JOIN, subquery, EXISTS).

  • Designed and implemented a relational database for a pet hotel using MySQL
  • ER modeling, schema design, and complex SQL queries (JOIN, subquery, EXISTS)
SQLDatabase DesignMySQLER ModelingRelational ModelSchema Design
View →
Work

Experience

Northeastern University

Graduate Teaching Assistant · Part-time · Boston, MA

Sep 2025 – Apr 2026

Teaching Assistant for Data Management for Analytics (DMA).

TeachingSQLData Management

Hexlant, Inc.

Software Developer · Full-time · Seoul, South Korea

Nov 2021 – Nov 2022

Developed NFT and DeFi projects, managed project databases, and implemented smart contracts.

SoliditySmart ContractsNFTDeFiSQL

Wigo, Inc.

Software Developer · Intern · Seoul, South Korea

Jan 2020 – Feb 2020
  • Collected and processed unstructured text data for NLP experiments (tokenization & normalization)
  • Evaluated chatbot model performance and implemented improvements to enhance accuracy
NLPPythonChatbotData Processing
Edu

Education

Northeastern University

MS in Data Analytics Engineering · GPA 4.0 · Boston, MA

May 2026

Data Mining · Data Management · Generative AI Systems

Kookmin University

BS in Computer Engineering · Seoul, South Korea

Aug 2021

Machine Learning · Database · NLP