Wuao Liu

Wuao Liu (刘武傲)

I am a Computer Science Ph.D. student working in the Computer Vision Lab at UMass Amherst, advised by Prof. Grant Van Horn. Previously, I received my M.S. and B.Eng. degrees in Robotics from University of Michigan, Ann Arbor and Zhejiang University, respectively.

My research leverages computer vision and multimodal machine learning for applications in biodiversity monitoring and conservation efforts. I currently focus on visual and auditory recognition of fine-grained categories, as well as wildlife species range estimation.

I’m actively looking for research collaborations. Please feel free to send me an email!

News

[06/2026]: Our work on Masked Autoencoders with Limited Data has been accepted as an oral talk at FGVC13@CVPR.
[03/2026]: I worked at Microsoft LinkedIn as a GenAI Research Intern.
[11/2025]: I served on the program committee for NECV 25 (UMass Amherst).
[05/2025]: I joined Honda Research Institute as a Research Scientist Intern.
[09/2024]: I started at UMass Amherst as a CS PhD student.
[06/2021]: I graduated with an honor degree from Zhejiang University.

▼ Click to see old news

Publications

Masked Autoencoders with Limited Data: Does It Work?
A Fine-Grained Bioacoustics Case Study

Wuao Liu, Mustafa Chasmai, Subhransu Maji, Grant Van Horn

Workshop on Fine-Grained Visual Categorization @ CVPR 2026 Oral Talk (1/35)

A systematic study of MAE pretraining for species classification on iNatSounds, analyzing the impacts of pretraining data scale, domain specificity, data curation, and transfer strategies.

PDF arXiv

Bioacoustic Geolocation: Species Sounds as Geographic Signals

Mustafa Chasmai, Wuao Liu, Subhransu Maji, Grant Van Horn

ICML 2026

Using a vision-inspired approach, we tackle the novel challenge of predicting the geographic location of audio recordings by leveraging species sounds and multimodal retrieval techniques.

Project Page PDF arXiv Code

RealBirdID: Benchmarking Bird Species Identification in the Era of MLLMs

Logan Lawrence, Mustafa Chasmai, Rangel Daroya, Wuao Liu, Seoyun Jeong, Aaron Sun, Max Hamilton, Fabien Delattre, Oindrila Saha, Subhransu Maji, Grant Van Horn

CVPR 2026

We introduce RealBirdID, a benchmark for fine-grained bird identification where models must either predict a species or abstain with an evidence-based rationale (for example, requiring vocalization, low-quality image, or occlusion).

PDF arXiv Code

Can Large Language Models Reason About Goal-Oriented Tasks?

Filippos Bellos, Yayuan Li, Wuao Liu, Jason J. Corso

Workshop on the Scaling Behavior of Large Language Models @ ACL 2024

We study how well LLMs can complete a sequence of steps to achieve a certain goal, such as making a sandwich or repairing a bicycle tire.

Project Page PDF Video

Experience

GenAI Research Intern, at LinkedIn
May 2026 - Aug 2026

Mentor: Yang Hu · Managers: Xiaoqing Wang, Jerry Shen
Research Scientist Intern, at Honda Research Institute
May 2025 - Aug 2025

Mentors: Nirav Savaliya, Goro Yeh

Wuao Liu (刘武傲)

News

Publications

Masked Autoencoders with Limited Data: Does It Work?
A Fine-Grained Bioacoustics Case Study

Bioacoustic Geolocation: Species Sounds as Geographic Signals

RealBirdID: Benchmarking Bird Species Identification in the Era of MLLMs

Can Large Language Models Reason About Goal-Oriented Tasks?

Experience

Academic Services

Wuao Liu (刘武傲)

News

Publications

Masked Autoencoders with Limited Data: Does It Work?A Fine-Grained Bioacoustics Case Study

Bioacoustic Geolocation: Species Sounds as Geographic Signals

RealBirdID: Benchmarking Bird Species Identification in the Era of MLLMs

Can Large Language Models Reason About Goal-Oriented Tasks?

Experience

Academic Services

Masked Autoencoders with Limited Data: Does It Work?
A Fine-Grained Bioacoustics Case Study