Haoxuan Li

Haoxuan Li

M.S. Computer Science

University of Southern California

Email: lihaoxua [at] usc [dot] edu

中文版点击这里



Hi, nice to meet you! I am a first-year Master student in computer science at University of Southern California. Previously, I received undergraduate degree in computer science and technology at Harbin Institute of Technology


My research interests generally lie in machine learning x {language, vision, finance, health, biotech}. I am interested in developing creative machine learning algorithms and applications that benefit from multi-modal information.


Feel free to email me if you want to discuss anything about research!

Education

Master of Science, Computer Science (expected in May 2025)

Electrical Engineering and Computer Science (Joint Program with HIT)

Bachelor of Engineering, Computer Science and Technology

Research Experience

UCI Computer Vision Lab

Oct 2022 - Jun 2023

Student Researcher

Mentor: Prof. Glenn Healey

  • Hyperspectral Image Data Processing and Analysis

    Analyzed hyperspectral data cubes through MATLAB, extracting spectra in SWIR and NVIR for building and vegetation regions to identify dominant bands

    Digitized imaging filters into CSVs and compared them in MATLAB against specific chemical emissions

    Digitized Mollweide projection plots of different pitch types for baseball players

HIT Massive Data Computing Lab

Mar 2021 - Jun 2023

Student Researcher

Mentor: Prof. Hongzhi Wang,     Dr. Xiaoou Ding

  • Research on Hybrid Error Data Cleaning Technology for Human In The Loop    

    Independently proposes and implements a hybrid human-in-the-loop framework for erroneous data cleaning

    Automatic Detection and Repair: The framework employs algorithms to automatically identify missing points and outliers in data, attempting auto-correction. When the algorithm's confidence in auto-correction does not reach a preset threshold, the system escalates the issue to human experts for intervention

    Human-Machine Interactive Repair: For data issues uncertain or beyond automatic correction, human experts are brought into the repair process. By recording their opinions and judgments, the system continuously adjusts the dynamic rational value range of the data, thereby enhancing the accuracy of anomaly detection and repair in subsequent processing

    Dynamic Data Quality Improvement: The efficacy of this cleaning framework has been validated through experiments assessing classification accuracy under different noise data ratios and regression accuracy following changes in the rational value environment

  • Research on Data Pricing Techniques Based on Data Quality Evaluation

    Contributed to constructing the relationship between data quality and pricing. Considered the weight of different indicators in pricing decisions, and the integration with current economic theories, to determine the appropriate pricing of data based on quantified evaluation metrics

  • Research on Time-Series Data Error Detection and Cleaning with Multi-Constraint

    Participated in the experiment of a new time-series data error detection and cleaning algorithm based on multiple constraints. Through comparative experiments with existing data cleaning methods, the superior performance of the newly proposed algorithm in addressing inaccuracies in time-series data was validated

    Replicated a mainstream algorithm "A symbolic representation of time series, with implications for streaming algorithms". The algorithm was broken down into viable steps, and by using or developing appropriate functions, theoretical models were converted into executable programs

Publications

Xiaoou Ding, Yingze Li, Chen Wang, Hongzhi Wang, Haoxuan Li

Journal of Software, 2023

Xiaoou Ding, Hongzhi Wang, Genglong Li, Haoxuan Li, Yingze Li, Yida Liu

Intelligent and Converged Networks, 2022