Fine tune Llava model on VQA pairs using NIH XRAY images

Written by

The Goal very simple: to fine-tune a vision-language model (LLaVA or CogVLM2) on a meaningful(but as small as feasibnle) subset of the NIH ChestX-ray14 dataset. The goal is to build a system capable of Visual Question Answering (VQA) tailored to medical diagnostics… (Budget: ₹600 – ₹1500 INR, Jobs: AI Chatbot Development, AI Development, AI Model Development, AI Research, AI Text-to-text, Computer Vision, Data Processing, Data Science, Machine Learning (ML), Natural Language Processing)

Fine tune Llava model on VQA pairs using NIH XRAY images

Comments

Leave a Reply Cancel reply

More posts

Senior Software Engineer Backend

Vintage-Inspired Logo & Branding Illustrations

Monthly Website Maintenance and SEO

Logo design