ML

Publications

Highlighted

End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li
International Conference on Machine Learning (ICML)  ·  2024

All

2024

Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
Rujie Wu, Xiaojian Ma, Zhenliang Zhang, Wei Wang, Qing Li, Song-Chun Zhu, Yizhou Wang
International Conference on Learning Representations (ICLR)  ·  2024
CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update
CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update
Zhi Gao, Yuntao Du, Xintong Zhang, Xiaojian Ma, Wenjuan Han, Song-Chun Zhu, Qing Li
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  ·  2024
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
Yue Fan, Xiaojian Ma, Rujie Wu, Yuntao Du, Jiaqi Li, Zhi Gao, Qing Li
European Conference on Computer Vision (ECCV)  ·  2024
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
Haozhe Zhao, Zefan Cai, Shuzheng Si, Xiaojian Ma, Kaikai An, Liang Chen, Zixuan Liu, Sheng Wang, Wenjuan Han, Baobao Chang
International Conference on Learning Representations (ICLR)  ·  2024
An Embodied Generalist Agent in 3D World
An Embodied Generalist Agent in 3D World
Jiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang
International Conference on Machine Learning (ICML)  ·  2024
End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li
International Conference on Machine Learning (ICML)  ·  2024
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Baoxiong Jia, Yixin Chen, Huangyue Yu, Yan Wang, Xuesong Niu, Tengyu Liu, Qing Li, Siyuan Huang
European Conference on Computer Vision (ECCV)  ·  2024
Neural-Symbolic Recursive Machine for Systematic Generalization
Neural-Symbolic Recursive Machine for Systematic Generalization
Qing Li, Yixin Zhu, Yitao Liang, Ying Nian Wu, Song-Chun Zhu, Siyuan Huang
International Conference on Learning Representations (ICLR)  ·  2024
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models
Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, …, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang
Transactions on Pattern Analysis and Machine Intelligence (TPAMI)  ·  2024

2023

3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
Ziyu Zhu, Xiaojian Ma, Yixin Chen, Zhidong Deng, Siyuan Huang, Qing Li
International Conference on Computer Vision (ICCV)  ·  2023
A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics
A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics
Qing Li, Siyuan Huang, Yining Hong, Yixin Zhu, Ying Nian Wu, Song-Chun Zhu
International Conference on Learning Representations (ICLR)  ·  2023
Exploring Data Geometry for Continual Learning
Exploring Data Geometry for Continual Learning
Zhi Gao, Chen Xu, Feng Li, Yunde Jia, Mehrtash Harandi, Yuwei Wu
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  ·  2023
SQA3D: Situated Question Answering in 3D Scenes
SQA3D: Situated Question Answering in 3D Scenes
Xiaojian Ma, Silong Yong, Zilong Zheng, Qing Li, Yitao Liang, Song-Chun Zhu, Siyuan Huang
International Conference on Learning Representations (ICLR)  ·  2023
Learning non-Markovian Decision-Making from State-only Sequences
Learning non-Markovian Decision-Making from State-only Sequences
Aoyang Qin, Feng Gao, Qing Li, Song-Chun Zhu, Sirui Xie
Advances in Neural Information Processing Systems (NeurIPS)  ·  2023
Meta-causal Learning for Single Domain Generalization
Meta-causal Learning for Single Domain Generalization
Jin Chen, Zhi Gao, Xinxiao Wu, Jiebo Luo
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  ·  2023
Learning to Optimize on Riemannian Manifolds
Learning to Optimize on Riemannian Manifolds
Zhi Gao, Yuwei Wu, Xiaomeng Fan, Mehrtash Harandi, Yunde Jia
Transactions on Pattern Analysis and Machine Intelligence (TPAMI)  ·  2023
Curvature-Adaptive Meta-Learning for Fast Adaptation to Manifold Data
Curvature-Adaptive Meta-Learning for Fast Adaptation to Manifold Data
Zhi Gao, Yuwei Wu, Mehrtash Harandi, Yunde Jia
Transactions on Pattern Analysis and Machine Intelligence (TPAMI)  ·  2023
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
Zihao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
Advances in Neural Information Processing Systems (NeurIPS)  ·  2023