Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
Please cite this work with the following BibTeX: @inproceedings{cocchi2024augmenting, title={{Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering}}, ...
The desktop application provides the best experience with zero environment setup required. Simply download and run.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results