Learning human-object interactions by graph parsing neural networks S Qi*, W Wang*, B Jia, J Shen, SC Zhu Proceedings of the European conference on computer vision (ECCV), 401-417, 2018 | 619 | 2018 |
Raven: A dataset for relational and analogical visual reasoning C Zhang*, F Gao*, B Jia, Y Zhu, SC Zhu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 270 | 2019 |
Diffusion-based Generation, Optimization, and Planning in 3D Scenes S Huang*, Z Wang*, P Li, B Jia, T Liu, Y Zhu, W Liang, SC Zhu arXiv preprint arXiv:2301.06015, 2023 | 121 | 2023 |
Learning perceptual inference by contrasting C Zhang*, B Jia*, F Gao, Y Zhu, H Lu, SC Zhu Advances in neural information processing systems 32, 2019 | 112 | 2019 |
Abstract spatial-temporal reasoning via probabilistic abduction and execution C Zhang*, B Jia*, SC Zhu, Y Zhu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 65 | 2021 |
Latent diffusion energy-based model for interpretable text modeling P Yu, S Xie, X Ma, B Jia, B Pang, R Gao, Y Zhu, SC Zhu, YN Wu Proceedings of the 39th International Conference on Machine Learning, PMLR …, 2022 | 64 | 2022 |
Acre: Abstract causal reasoning beyond covariation C Zhang, B Jia, M Edmonds, SC Zhu, Y Zhu Proceedings of the ieee/cvf conference on computer vision and pattern …, 2021 | 41 | 2021 |
LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities B Jia, Y Chen, S Huang, Y Zhu, S Zhu European Conference on Computer Vision, 767-786, 2020 | 40 | 2020 |
Generalized earley parser: Bridging symbolic grammars and sequence data for future prediction S Qi, B Jia, SC Zhu International conference on machine learning, 4171-4179, 2018 | 36 | 2018 |
A generalized earley parser for human activity parsing and prediction S Qi, B Jia, S Huang, P Wei, SC Zhu IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (8), 2538-2554, 2020 | 35 | 2020 |
Learning algebraic representation for systematic generalization in abstract reasoning C Zhang*, S Xie*, B Jia*, YN Wu, SC Zhu, Y Zhu Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 34 | 2022 |
Mining user reviews for mobile app comparisons Y Li, B Jia, Y Guo, X Chen Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous …, 2017 | 34 | 2017 |
Improving Object-centric Learning with Query Optimization B Jia*, Y Liu*, S Huang The Eleventh International Conference on Learning Representations, 2023 | 32* | 2023 |
Egotaskqa: Understanding human tasks in egocentric videos B Jia, T Lei, SC Zhu, S Huang Thirty-sixth Conference on Neural Information Processing Systems Datasets …, 2022 | 29 | 2022 |
An Embodied Generalist Agent in 3D World J Huang*, S Yong*, X Ma*, X Linghu*, P Li, Y Wang, Q Li, SC Zhu, B Jia, ... arXiv preprint arXiv:2311.12871, 2023 | 24 | 2023 |
ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes R Gong*, J Huang*, Y Zhao, H Geng, X Gao, Q Wu, W Ai, Z Zhou, ... arXiv preprint arXiv:2304.04321, 2023 | 13* | 2023 |
Sceneverse: Scaling 3d vision-language learning for grounded scene understanding B Jia, Y Chen, H Yu, Y Wang, X Niu, T Liu, Q Li, S Huang arXiv preprint arXiv:2401.09340, 2024 | 10 | 2024 |
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI Y Yang*, B Jia*, P Zhi, S Huang arXiv preprint arXiv:2404.09465, 2024 | 4 | 2024 |
Closed-loop open-vocabulary mobile manipulation with gpt-4v P Zhi, Z Zhang, M Han, Z Zhang, Z Li, Z Jiao, B Jia, S Huang arXiv preprint arXiv:2404.10220, 2024 | 3 | 2024 |
Move as You Say Interact as You Can: Language-guided Human Motion Generation with Scene Affordance Z Wang, Y Chen, B Jia, P Li, J Zhang, J Zhang, T Liu, Y Zhu, W Liang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 3 | 2024 |