-
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-trainingICCV 2023
-
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question AnsweringT-PAMI 2023
-
DreamEditor: Text-Driven 3D Scene Editing with Neural FieldsSIGGRAPH Asia 2023
-
Visual Causal Scene Refinement for Video Question AnsweringACM MM 2023
-
Parametric Linear Blend Skinning Model for Multiple-Shape 3D GarmentsArxiv 2024
-
TCGL: Temporal Contrastive Graph for Self-supervised Video Representation LearningT-IP 2022
-
Towards Controllable One-Shot Text-to-image Generation via Contrastive Prompt-Tuningarxiv 2022
-
Linguistically Routing Capsule Network for Out-of-distribution Visual Question AnsweringICCV 2021
-
Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchmark and Adversarial Graph LearningT-PAMI 2021
-
Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation EmbeddingT-NNLS 2021
-
Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd CountingCVPR 2021
-
Semantics-Aware Adaptive Knowledge Distillation for Sensor-to-Vision Action RecognitionT-IP 2021
-
Structured Attention Network for Referring Image SegmentationT-MM 2021
-
Relationship-Embedded Representation Learning for Grounding Referring ExpressionsT-PAMI 2021
-
SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous DrivingNeurIPS 2021 Datasets and Benchmarks Track
-
Bidirectional Graph Reasoning Network for Panoptic SegmentationCVPR 2020
-
Graphonomy: Universal Image Parsing via Graph Reasoning and TransferT-PAMI 2020
-
Knowledge-Guided Multi-Label Few-Shot Learning for General Image RecognitionT-PAMI 2020
-
3D Human Pose Machines with Self-supervised Learning”. To appear in IEEE Transactions on Pattern Analysis and Machine IntelligenceTPAMI 2019
-
Crowd Counting with Deep Structured Scale Integration NetworkICCV 2019
-
Non-locally Enhanced Encoder-Decoder Network for Single Image De-raining” ACM International Conference on MultimediaACM MM 2018
-
Flow Guided Recurrent Neural Encoder for Video Salient Object DetectionCVPR 2018
-
Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic EmbeddingACM MM 2018
-
Leaning to Segment Object Proposals via Recursive Neural NetworksTIP 2018
-
Hierarchical Scene Parsing by Weakly Supervised Learning with Image DescriptionsTPAMI 2018