2nd-year PhD Student Visiting Researcher ๐ I love Engineering (mechanical, electromechanical, and algorithmic) as much as I love Hotpot ๐ฒ I practice and support Slow Science, just because I need time to understand the problems that I study. Publications | Patent | Projects | Google Scholar | Github | LinkedIn | Twitter | CV | CV of Failures | Services | WolowitzEmail: mingxuan.liu@unitn.it (primary) | mingxuanliu.miu@gmail.com |
I am a second-year PhD student in Deep Learning and Computer Vision at the University of Trento, under the joint supervision of Prof. Elisa RICCI and Prof. Zhun ZHONG. Currently, I am also a visiting researcher at NAVER LABS Europe exploring open-vocabulary object detection supervised by Gabriela CSURKA, Riccardo VOLPI, and Tyler L. HAYES.
My research is dedicated to improving machine Open-world Perception and Understanding through visual input. This endeavor sparked my keen interest in areas Multi-modal Learning (open-vocabulary object detection, vision-language), and Novel Knowledge Discovery and Reasoning (semi-/un-supervised learning, incremental learning, clustering, and multi-modal reasoning). Recently, my focus has particularly shifted towards exploring how the knowledge and reasoning capabilities of Large Language Models (LLMs) can help vision tasks in real-world scenarios.
Prior to commencing my doctoral studies, I cultivated a profound interest in Robotics, which led me to earn two Master's degrees in Autonomous Systems with distinction from KTH Royal Institute of Technology and the University of Trento. Before embarking on my academic journey, I gained valuable industry experience over three years as an Innovation Engineer at SIEMENS Smart Infrastructure Division. There, I played an active role in developing innovative automation solutions using Internet-of-Things (IoT) technologies.
In my free time, I enjoy bodybuilding, playing basketball, hiking, and cooking! Big fan of The Big Bang Theory and Eminem.
10/2024: I'm excited to share that I'll be joining the Zhou Lab at UCLA as a visiting student from Jan-2025 for six months, where I'll have the privilege of being advised by Prof. Bolei Zhou . Looking forward to diving into some cool research (w. more more dimensions? : ) at Zhou Labโand also hitting the Muscle Beach Venice Gym to lift some weights!
10/2024: Our new work on automatically discovering grouping criteria and visual data semantic substructures are preprinted. Check it out here!
09/2024: Happy to serve as a reviewer for ICLR 2025.
07/2024: I successfully got $5,000 funding (in credits) from OpenAI to support my research!
05/2024: One paper on incremental novel class discovery with large scale pre-trained model is accepted Oral paper at ICPR 2024..
05/2024: I am invited to serve as a reviewer for NeurIPS 2024.
05/2024: Filed my first US Patent: "A Method for Using Semantic Hierarchy Trees to Increase the Robustness of Open-vocabulary Object Detection Models"!
02/2024: One paper on open-vocabulary object detection with semantic hierarchy (work done with NAVER LABS Europe) is accepted as Highlight paper (2.8% acceptance rate) at CVPR 2024! Thanks to the team!
02/2024: I am invited to serve as a reviewer for ECCV 2024 and IJCV.
01/2024: One paper on discovering fine-grained semantic concepts with LLMs is accepted to ICLR 2024! See you in Vienna this May!
A Method for Using Semantic Hierarchy Trees to Increase the Robustness of Open-vocabulary Object Detection Models |
Organizing Unstructured Image Collections using Natural Language |
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection |
Democratizing Fine-grained Visual Recognition with Large Language Models |
Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery |
Class-incremental Novel Class Discovery * = co-first author |
Siemens Designo CC Building Management System Software and Practice Tutorial book about building automation management software (Siemens Desigo CC). Work carried out at SIEMENS. |
Siemens RWG Control Platform Advanced Course and Practice Tutorial book about IoT-based building automation control platform, including Programmable Logic Controller (PLC), Internet-of-Things (IoT), and cloud-based Software-as-a-Service (SaaS). Work carried out at SIEMENS. |
The V-SLAM Hurdler: A Faster V-SLAM System using Online Semantic Dynamic-and-Hardness-aware Approximation |
ORB-SLAM3 Deployment on Underwater Autonomous Vehicle (UAV) SAM |
Hot Steel Plate Tracking and Rotation Angle Detection |
Design, Kinematic and Dynamic Simulation of a Mini Cheetah Robotic Leg |
Virtual RGB-D Camera Unity Implementation |
ICLR: Reviewer'2025
NeurIPS: Reviewer'2024
CVPR: Reviewer'2024
ECCV: Reviewer'2024
IJCV: Reviewer'2024
This webpage template was inspired by Prof. Jia-Bin Huang's personal webpage.