Takip et
Tongtian Yue
Tongtian Yue
ia.ac.cn üzerinde doğrulanmış e-posta adresine sahip
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Chatbridge: Bridging modalities with large language model as a language catalyst
Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu, Z Yuan, J Liu
arXiv preprint arXiv:2305.16103, 2023
632023
Needle in a video haystack: A scalable synthetic framework for benchmarking video mllms
Z Zhao, H Lu, Y Huo, Y Du, T Yue, L Guo, B Wang, W Chen, J Liu
arXiv e-prints, arXiv: 2406.09367, 2024
172024
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation
W Wang, T Yue, Y Zhang, L Guo, X He, X Wang, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
172024
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
T Yue, J Cheng, L Guo, X Dai, Z Zhao, X He, G Xiong, Y Lv, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
152024
EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training
T Yue, S Xue, X Gao, Y Tang, L Guo, J Jiang, J Liu
arXiv preprint arXiv:2410.19779, 2024
62024
OneDiff: A Generalist Model for Image Difference Captioning
E Hu, L Guo, T Yue, Z Zhao, S Xue, J Liu
Proceedings of the Asian Conference on Computer Vision, 2439-2455, 2024
32024
Chatsearch: A dataset and a generative retrieval model for general conversational image retrieval
Z Zhao, L Guo, T Yue, E Hu, S Shao, Z Yuan, H Huang, J Liu
Pattern Recognition, 111696, 2025
22025
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities
J Liu, W Wang, Y Zhang, Y Tang, X He, L Guo, T Yue, X Wang
arXiv preprint arXiv:2504.01954, 2025
22025
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs
T Yue, L Guo, J Cheng, X Gao, H Huang, J Liu
The Thirteenth International Conference on Learning Representations, 2024
22024
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs
Z Zhao, H Lu, Y Huo, Y Du, T Yue, L Guo, B Wang, W Chen, J Liu
arXiv preprint arXiv:2406.09367, 2024
22024
Collaborative Training of Tiny-Large Vision Language Models
S Lu, L Guo, W Wang, Z Zhao, T Yue, J Liu, S Liu
Proceedings of the 32nd ACM International Conference on Multimedia, 4928-4937, 2024
12024
LaVi: Efficient Large Vision-Language Models via Internal Feature Modulation
T Yue, L Guo, Y Tang, Z Zhao, X Zhu, H Huang, J Liu
arXiv preprint arXiv:2506.16691, 2025
2025
Prefix Grouper: Efficient GRPO Training through Shared-Prefix Forward
Z Liu, T Yue, Y Tang, L Guo, J Cai, Q Liu, X Chen, J Liu
arXiv preprint arXiv:2506.05433, 2025
2025
Efficient Motion-Aware Video MLLM
Z Zhao, Y Huo, T Yue, L Guo, H Lu, B Wang, W Chen, J Liu
Proceedings of the Computer Vision and Pattern Recognition Conference, 24159 …, 2025
2025
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–14
OSZAR »