ARCHIVES

Original Article

Agentic AI Based Smart Assistant: A Multimodal Visual Question Answering System Using Fast API and GROQ Vision-Language Models

Gaurav Arya1 Shuchi Sharma2
1 Student, Department of AIML, ADGIPS, FC-26 Shastri Park, Shahdara, New Delhi, India. 2 Assistant Professor, Department of AIML, ADGIPS, FC-26 Shastri Park, Shahdara, New Delhi, India.

Published Online: May-August 2026

Pages: 237-240

Abstract

This paper presents the design and implementation of an Agentic AI Based Smart Assistant — a multimodal web application capable of analyzing images and answering natural language queries in real time. The system integrates FastAPI as the backend web framework with the GROQ API, leveraging LLaMA-based Vision-Language Models (VLMs) to interpret both visual and textual data simultaneously. An additional Retrieval-Augmented Generation (RAG) pipeline using TF-IDF vectorization enables document-aware question answering from uploaded PDFs. The system was tested across ten functional scenarios including valid and invalid inputs, large images, concurrent requests, and API fault conditions — all passing successfully. Results demonstrate strong contextual accuracy and low-latency performance suitable for real-world applications in medical imaging, smart education, and automated inspection.

Related Articles

2026

Artificial Intelligence in Learning and Teaching

2026

Admin Assist: An AI – Driven Configuration and Orchestration for Enterprise Application

2026

Enhancing Blood Group Identification using pigeon inspired optimization: An Innovative Approach

2026

Eco-Genius: Power Up Smart, Power Down Waste

2026

Crowd-Sourced Disaster Response and Rescue Assistant

2026

Unveiling Deepfake Detection Using Vision Transformers: A Survey and Experimental Study

Share Article

X
LinkedIn
Facebook
WhatsApp

Or copy link

https://test.indjcst.com/archives/10.59256/indjcst.20260502026

*Instagram doesn't support direct link sharing from web. Copy the link and share it in your Instagram story or post.