<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Truong-Binh Duong | AI Researcher</title><description>Personal academic website of Truong-Binh Duong, an AI researcher working on multimodal learning, vision-language models, robust visual reasoning, and generalizable AI systems.</description><link>https://duongtruongbinh.github.io/</link><item><title>[Publication] ViInfographicVQA: A Benchmark for Single and Multi-Image Visual Question Answering on Vietnamese Infographics</title><link>https://duongtruongbinh.github.io/publications/viinfographicvqa/</link><guid isPermaLink="true">https://duongtruongbinh.github.io/publications/viinfographicvqa/</guid><description>A Vietnamese benchmark for evaluating single-image and multi-image reasoning on information-rich infographics.</description><pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate></item><item><title>[Publication] An Automated Pipeline for Constructing a Vietnamese VQA-NLE Dataset</title><link>https://duongtruongbinh.github.io/publications/vivqa-x/</link><guid isPermaLink="true">https://duongtruongbinh.github.io/publications/vivqa-x/</guid><description>An automated pipeline for constructing a Vietnamese visual question answering dataset with natural-language explanations.</description><pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate></item><item><title>[Publication] Describe Anything Model for Visual Question Answering on Text-Rich Images</title><link>https://duongtruongbinh.github.io/publications/dam-qa/</link><guid isPermaLink="true">https://duongtruongbinh.github.io/publications/dam-qa/</guid><description>An investigation of region-level descriptions from the Describe Anything Model for visual question answering on text-rich images.</description><pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate></item></channel></rss>