Tsung-Yu Lin

Research Scientist
Meta AI
tsungyulin at meta.com

me

I am a Research Scientist at Meta, where my work centers on image and video representation learning as well as vision-language modeling. Prior to joining Meta, I completed my doctoral studies in the Computer Vision Lab at the University of Massachusetts Amherst under the supervision of Prof. Subhransu Maji.

My research aims to advance the development of AI systems capable of interpreting and interacting with the physical world to support people in their everyday lives. I am particularly interested in semantic visual representations learning, fine-grained visual understanding, geometric modeling, temporal reasoning, and building systems that robustly generalize to real-world environments.

I am fortunate to have collaborated with numerous distinguished researchers and interns in the fields of machine learning and artificial intelligence. Additional information can be found in my CV and Google Scholar profile.



Home        

Research        

Publication        

CV


Projects
Second-order Democratic Aggregation
Dark Ecology: tracking bird migration with computer vision and deep learning techniques
Visualizing and Understanding Deep Texture Representations
Bilinear CNN for Fine-Grained Classification
People localization in a camera network
pdf
Shape Prior Non-Rigid SfM for deformable Surfaces
pdf
Multi-camera and multi-target surveillance tracking 3D Face Model Deformation


Publications

Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs

CVPR 2024    [pdf]

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

CVPR 2023    (Highlight, Acceptance rate: ~10% of accepted papers)    [pdf]

Few-shot fast-adaptive anomaly detection

NeurIPS 2022    [pdf]

Raising the Bar on the Evaluation of Out-of-Distribution Detection

Arxiv [paper]

MistNet: Measuring Historical Bird Migration in the US Using Archived Weather Radar Data and Convolutional Neural Networks

Methods in Ecology and Evolution, August 2019    [project]
2019 Robert May Early Career Research Prize Shortlist [The Shortlist]

Second-order Democratic Aggregation

ECCV 2018    [project] [pdf] [arXiv]

Improved Bilinear Pooling with CNNs

BMVC 2017 (Oral)    [project] [pdf] [arXiv]

Bilinear Convolutional Neural Networks for Fine-grained Visual Recognition

PAMI 2017   [project] [pdf]

Visualizing and Understanding Deep Texture Representations

CVPR 2016    [project] [pdf] [arXiv]

One-to-many face recognition with Bilinear CNNs

WACV 2016   [pdf]

Implicit Sparse Code Hashing

arXiv:1512.00130 , 2015    [arXiv]

Bilinear CNN Models for Fine-grained Visual Recognition

ICCV 2015  (Oral, Acceptance rate: 3.3%)    [project] [pdf] [pdf-supp] [arXiv] [slides] [poster] [bibtex]

Efficient binary codes for extremely high-dimensional data

ICIP 2014  [pdf]

People localization in a camera network combining background subtraction and scene-aware human detection

MMM 2011  [pdf]