Describe Anything Model for Visual Question Answering on Text-Rich Images
An investigation of region-level descriptions from the Describe Anything Model for visual question answering on text-rich images.
Content tagged with "text-rich images"
An investigation of region-level descriptions from the Describe Anything Model for visual question answering on text-rich images.