I am an AI researcher and Teaching Assistant at AI VIETNAM. I received my B.Sc. in Information Technology from the University of Science, Vietnam National University Ho Chi Minh City, graduating with a GPA of 3.83/4.0 and Dean’s List recognition in five semesters.

My research focuses on building multimodal AI systems that can reason reliably across visual and linguistic information. I am particularly interested in improving model robustness, evaluation, and generalization beyond standard in-domain benchmarks.

Research Interests

My current research interests include:

  • Multimodal learning and vision-language models
  • Robust visual question answering and visual grounding
  • Embodied AI and vision-language-action models

Current Work

At AI VIETNAM, I conduct research on vision-language models and LLM reasoning while contributing to the development of teaching materials for artificial intelligence courses.

My recent work has explored Vietnamese multimodal datasets, text-rich and multi-image visual question answering, automated data construction pipelines, and counterfactual learning for robust visual reasoning.

Research Direction

My long-term goal is to develop AI systems that combine visual perception, language reasoning, and action while remaining robust under distribution shifts and real-world uncertainty.