Self-Training on Image Comprehension (STIC): A Novel Self-Training Approach Designed to Enhance the Image Comprehension Capabilities of Large Vision Language Models (LVLMs)
Large language models (LLMs) have gained significant attention due to their advanced capabilities in processing and generating text. However, the…