ladybug11's picture
Update README.md
636ef7c verified

A newer version of the Gradio SDK is available: 6.1.0

Upgrade
metadata
title: AIQuoteClipGenerator
emoji: ๐ŸŽฌ
colorFrom: purple
colorTo: pink
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
license: apache-2.0
short_description: AI-powered tool that automatically generates quote video
tags:
  - mcp-in-action-track-productivity

๐ŸŽฌ AI Quote Clip Generator

Autonomous MCP Agent โ€ข Trend-Aware Quote Studio โ€ข Multimodal Generation
Track 2: MCP in Action โ€“ Productivity

AI Quote Clip Generator is an MCP-powered autonomous system that creates aesthetic, trend-aware quote videos for TikTok, Instagram Reels, and Shorts.
It combines Gemini + OpenAI + ElevenLabs + Modal + Pexels into a single intelligent pipeline that plans, generates, narrates, and renders short-form content automatically.


๐Ÿ”ฎ Live Demo

โ–ถ๏ธ Click here to watch the demo video


๐Ÿš€ What It Does

With a single click, the system:

  • Generates non-repetitive Gemini-powered quotes
  • Applies a persona style (Coach, Philosopher, Poet, Mentor)
  • Uses trend-aware insight fusion for modern themes
  • Creates voice-over explanations (OpenAI โ†’ ElevenLabs)
  • Retrieves cinematic vertical footage from Pexels
  • Renders 7โ€“20 second videos using Modal
  • Saves all clips to a persistent gallery
  • Shows a full agent activity log

This makes the app a full AI-powered short-form content studio.


๐Ÿง  Why This Fits MCP Track 2 โ€“ Productivity

  • Uses a smolagents CodeAgent orchestrating multi-step workflows
  • Multiple tools are invoked as MCP-style functions
  • Executes LLM โ†’ video search โ†’ voice synthesis โ†’ rendering โ†’ caption generation
  • Incorporates a mini-RAG trend engine for contextual relevance
  • Demonstrates a high-value productivity workflow that automates real creative work
  • Fully multimodal: text โ†’ audio โ†’ video โ†’ metadata generation

๐Ÿ› ๏ธ MCP Tools Used

Tool Description
generate_quote_tool Produces unique, trend-aware quotes (Gemini primary)
search_pexels_video_tool Retrieves aesthetic Pexels footage
create_quote_video_tool Sends rendering jobs to Modal
(internal) generate_voice_commentary Voice-over commentary via OpenAI + ElevenLabs

All tools are orchestrated by an autonomous MCP-style agent chain.


๐Ÿ“Š Agent Pipeline Overview

  1. Build context โ†’ niche + persona + trend label
  2. Generate quote (Gemini primary, OpenAI fallback)
  3. Create voice-over commentary
  4. Retrieve video footage (Pexels)
  5. Render final video (Modal)
  6. Save in gallery + display caption/hashtags

๐Ÿงฉ Core Components

1. Autonomous MCP Agent Pipeline

Multi-step reasoning pipeline orchestrated with smolagents.

2. Gemini-Enhanced Quote Generator

Unique, non-repetitive quote generation with per-niche memory.

3. Trend-Aware Mini-RAG Engine

Fuses curated trend insights (Soft Life, Discipline Era, Glow-Up, etc.) into the content.

4. ElevenLabs Voice Studio

Creates natural voice-over narrations for short-form content.

5. Modal Render Engine

Fast cloud video rendering synced to narration length.

6. Pexels Multimodal Search Tool

Fetches cinematic vertical videos matched to niche/persona/trend.

7. Dynamic Aesthetic Text Layouts

Three design presets inspired by high-performing TikTok formats.

8. Persistent Video Gallery

Full history of generated clips stored in the app.


๐Ÿ”— Submission Post


๐Ÿง‘โ€๐Ÿ’ป Author

Meheret Egzerab


๐Ÿ“ License

apache-2.0