I am a doctoral student in the Massive Data & Algorithms (MiDAS) research group at BU,
advised by Dr. Evimaria Terzi.
My research focuses on balancing submodular utility and cost tradeoffs in combinatorial optimization problems,
with applications to team formation, influence maximization, and recommender systems.
I am also interested in social computing, and have studied algorithmic personalization in social media feeds.
Previously I was at IBM in Cambridge, MA, building end-to-end machine learning solutions in production contexts.
I enjoy playing tennis and soccer and spending time outdoors running, skiing, and hiking. I also like challenging myself with chess, poker, and other strategy games. I am an enthusiastic home cook and barista, and enjoy exploring the greater Boston area in search of good food and drink!
I am on the job market for industry roles in applied research, ML engineering, and data science starting Summer 2026.
We study submodular optimization problems where the goal is to maximize utility while minimizing cost, and move beyond traditional approaches that return only a single solution with a fixed tradeoff. We develop efficient algorithms to compute approximate Pareto frontiers, providing representative solutions that expose the full spectrum of achievable utility–cost tradeoffs in applications such as recommender systems, influence maximization, and team formation.
We introduce a unified QUBO-based formulation for team formation that jointly optimizes skill coverage and expert cost across multiple cost definitions. Our approach achieves performance comparable to established methods and enables transfer learning via graph neural networks that learn reusable expert and skill representations
We study team formation problems that balance maximizing skill coverage with minimizing expert workload, and extend the setting to coordination graphs that capture collaboration constraints between experts. We develop scalable approximation algorithms with provable guarantees that efficiently optimize coverage, workload balance, and coordination on real-world datasets.
We develop a temporal graph framework to detect and quantify personalization in social media feeds by distinguishing between exploration and exploitation in recommendation timelines. Using real and baseline TikTok datasets, we show how user interactions such as viewing, liking, and following shape personalization, enabling transparent and explainable auditing of recommendation systems.
We introduce the Balanced Coverage problem, which assigns experts to tasks to maximize skill coverage while simultaneously minimizing imbalance in workload across experts. Despite the problem’s NP-hardness, we develop a scalable approximation algorithm with provable guarantees that efficiently balances coverage and workload on real-world datasets.
Distributed Apache Flink pipeline for streaming EMA-based trend detection and breakout identification. Uses custom window operators and parallel event generation for scalable throughput. Benchmarked on DEBS 2022 Grand Challenge metrics with low latency and high batch throughput.
Comparative analysis of COVID-19 spread and mobility across US states, controlled by clustering on population and density. Characterizes travel behavior using daily trip metrics and explores temperature effects on safety practices. Highlights political and behavioral contrasts in early-pandemic mobility patterns.
Real-time lyric alignment system for live music, supported by a Xerox research fellowship with the Audio Information Research Lab. Built a multi-threaded audio pipeline and aligned live streams using chroma features and online dynamic time warping. Demonstrates low-latency synchronization between live audio and pre-encoded lyrics.