Avatar of Truong-Binh Duong

Truong-Binh Duong

AI VIETNAM

AI researcher working on multimodal learning, vision-language models, and robust visual reasoning

  • About
  • Publications
  • Research
  • CV

Publications

Peer-reviewed papers and preprints in multimodal learning, visual question answering, and vision-language models.

ViInfographicVQA: A Benchmark for Single and Multi-Image Visual Question Answering on Vietnamese Infographics
January 2026 Tue-Thu Van-Dinh*, Hoang-Duy Tran*, Truong-Binh Duong, et al. AAAI Workshop on AI for Scientific Research, 2026
#Vision-Language Models #Visual Question Answering #Multi-Image Reasoning #Vietnamese AI

A Vietnamese benchmark for evaluating single-image and multi-image reasoning on information-rich infographics.

View
An Automated Pipeline for Constructing a Vietnamese VQA-NLE Dataset
November 2025 Truong-Binh Duong, Hoang-Minh Tran, Binh-Nam Le-Nguyen, and Dinh-Thang Duong Proceedings of ICISN 2025, Springer LNNS, Vol. 1596
#Visual Question Answering #Multimodal Learning #Dataset Construction #Vietnamese AI

An automated pipeline for constructing a Vietnamese visual question answering dataset with natural-language explanations.

View
Describe Anything Model for Visual Question Answering on Text-Rich Images
October 2025 Yen-Linh Vu*, Dinh-Thang Duong*, Truong-Binh Duong, et al. VisionDocs Workshop at ICCV 2025
#Vision-Language Models #Text-Rich Images #Visual Question Answering #Document Intelligence

An investigation of region-level descriptions from the Describe Anything Model for visual question answering on text-rich images.

View
© 2026 Truong-Binh Duong.