Agentic Chat
A minimal, tool-powered chatbot for web, files, vision, and audio.
Tool-augmented
Web + Files + Vision + Audio
Conversation + Tool Trace
Agent capabilities (tool-driven)
read_file_content— read local files (txt, csv, json, xlsx, code, etc.)python_executor— local computation & data analysis (pandas, numpy)vision_analyze— interpret images, charts, diagrams, screenshotsaudio_transcribe— transcribe local audio (Whisper)yt_transcript— fetch YouTube transcript textweb_search— search the web for fresh infovisit_webpage— fetch and read a page in fullfind_in_page— search inside a page or text
Tools used (last response)
Conversation (full)
Local setup guide
- Create a virtual environment and install dependencies.
python -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activate
pip install -r requirements.txt
- Set up environment variables.
cp .env.example .env
Fill in your keys in .env.
- Launch the app.
python app.py
Then open http://localhost:7860.
Required keys
GROQ_API_KEY— LLM inference + audio/vision toolsSERPER_API_KEY— web search (web_search)
Optional
OPENAI_API_KEY— fallback LLMLANGFUSE_PUBLIC_KEY,LANGFUSE_SECRET_KEY(+LANGFUSE_HOST) — tracing
The agent will try these models in order (if keys are set):
openai/gpt-oss-120b— enabledmoonshotai/kimi-k2-instruct— enabledllama-3.3-70b-versatile— enabledmeta-llama/llama-4-maverick-17b-128e-instruct— enabledmeta-llama/llama-4-scout-17b-16e-instruct— enabledqwen/qwen3-32b— enabledllama-3.1-8b-instant— enabledopenai/gpt-4o-mini— missing OPENAI_API_KEY