We present ViVQA-X, the first Vietnamese Visual Question Answering dataset with natural language explanations. Our work introduces an automated pipeline leveraging multiple Large Language Models as evaluators to construct a high-quality Vietnamese VQA dataset with explanations.
@inproceedings{duong2026vivqax,title={An Automated Pipeline for Constructing a Vietnamese VQA-NLE Dataset},author={Duong, Truong-Binh and Tran, Hoang-Minh and Le-Nguyen, Binh-Nam and Duong, Dinh-Thang},booktitle={Proceedings of the Fifth International Conference on Intelligent Systems and Networks},series={Lecture Notes in Networks and Systems},pages={164--173},year={2025},publisher={Springer Nature Singapore},isbn={978-981-95-1746-6},doi={10.1007/978-981-95-1746-6_18},github={duongtruongbinh/ViVQA-X},}
ICCV Workshop
Describe Anything Model for Visual Question Answering on Text-rich Images
Yen-Linh Vu*, Dinh-Thang Duong*, Truong-Binh Duong, and 1 more author
We propose the Describe Anything Model (DAM-QA) for Visual Question Answering on text-rich images. Our approach demonstrates strong performance across six benchmark datasets through comprehensive evaluation of multiple Vision-Language Models.
@article{vu2025dam,title={Describe Anything Model for Visual Question Answering on Text-rich Images},author={Vu, Yen-Linh and Duong, Dinh-Thang and Duong, Truong-Binh and others},journal={ICCV Workshop},year={2025},publisher={IEEE},github={Linvyl/DAM-QA},note={*Equal contribution}}
2024
HackHCMC
Heineken Image Analysis Tool: Multi-Model AI Pipeline for Brand Compliance
Engineered a multi-model AI pipeline integrating YOLOv10, Owlv2, PaddleOCR, and CLIP for automated brand compliance and safety analysis from images. Deployed using FastAPI backend for robust model inference.
2023
Personal
Math Solving Assistant: AI-Powered Mathematical Problem Solver
Developed a chatbot website to solve math problems using Gemini API, integrating OCR for image input. Fine-tuned Gemini on MetaMathQA-40K with Vertex AI for improved mathematical reasoning.
2022
Course
Manga Popularity Prediction: End-to-End Data Science Pipeline
Conducted comprehensive data science pipeline including web scraping, exploratory analysis, and predictive modeling. Implemented multiple regression models (Linear Regression, Random Forest, XGBoost) with Bayesian optimization for hyperparameter tuning.