I am a M.S. Computer Science student @Northeastern University. My expected graduation date is between Aug to Dec, 2025.
I am passionate about AI system. I've explored LLM inference acceleration, tensor parallelism, and model fine-tune.
Here's a handful selection of my recent experience:
-
Designed a housing inventory prediction model during a machine learning engineer intern @Berkshire Hathaway HomeServices.
-
Explored CPU-Only LLM Acceleration with model distillation.
-
Built a framework to asynchronously schedule tasks during deep learning tensor computation with 4x performance improvement in parallel request query.
I have also contributed to open-source projects such as ZenML.