Agentic AI Based Smart Assistant: A Multimodal Visual Question Answering System Using Fast API and GROQ Vision-Language Models

Gaurav Arya; Shuchi Sharma

doi:https://www.doi.org/10.59256/indjcst.20260502026

ARCHIVES

Original Article

Agentic AI Based Smart Assistant: A Multimodal Visual Question Answering System Using Fast API and GROQ Vision-Language Models

Gaurav Arya¹ Shuchi Sharma²

¹ Student, Department of AIML, ADGIPS, FC-²⁶ Shastri Park, Shahdara, New Delhi, India. ² Assistant Professor, Department of AIML, ADGIPS, FC-²⁶ Shastri Park, Shahdara, New Delhi, India.

Published Online: May-August 2026

Pages: 237-240

Cite this article

↗ https://www.doi.org/10.59256/indjcst.20260502026

Abstract

View PDF

This paper presents the design and implementation of an Agentic AI Based Smart Assistant — a multimodal web application capable of analyzing images and answering natural language queries in real time. The system integrates FastAPI as the backend web framework with the GROQ API, leveraging LLaMA-based Vision-Language Models (VLMs) to interpret both visual and textual data simultaneously. An additional Retrieval-Augmented Generation (RAG) pipeline using TF-IDF vectorization enables document-aware question answering from uploaded PDFs. The system was tested across ten functional scenarios including valid and invalid inputs, large images, concurrent requests, and API fault conditions — all passing successfully. Results demonstrate strong contextual accuracy and low-latency performance suitable for real-world applications in medical imaging, smart education, and automated inspection.

Quick Links

Download

Manuscript Template Copyright Form

Policies

Share Article

X

Facebook

Or copy link

https://test.indjcst.com/archives/10.59256/indjcst.20260502026

*Instagram doesn't support direct link sharing from web. Copy the link and share it in your Instagram story or post.

ARCHIVES

Agentic AI Based Smart Assistant: A Multimodal Visual Question Answering System Using Fast API and GROQ Vision-Language Models

Cite this article

Abstract

Related Articles

Artificial Intelligence in Learning and Teaching

Admin Assist: An AI – Driven Configuration and Orchestration for Enterprise Application

Enhancing Blood Group Identification using pigeon inspired optimization: An Innovative Approach

Eco-Genius: Power Up Smart, Power Down Waste

Crowd-Sourced Disaster Response and Rescue Assistant

Unveiling Deepfake Detection Using Vision Transformers: A Survey and Experimental Study

PlumX Metrics

Dimension

Quick Links

Download

Policies

Share Article