πŸ“ Persian ASR, Translation & Summarization

Welcome to the Persian Speech-to-Text & NLP platform! This app allows you to upload an audio file,
get an accurate transcription, and enhance the output with translation, summarization,
and punctuation restoration.

🎯 How It Works

1️⃣ Upload an audio file containing Persian speech. To transcribe YouTube videos, open this Colab Notebook (If your request was detected as bot, disconnect and run all cells again).
2️⃣ Click "Transcribe" to generate the text output.
3️⃣ Use additional features: Translate, Summarize, or Restore Punctuation.
4️⃣ Customize settings: Select a language, AI model, and summary length.
5️⃣ View and copy the processed text for your use!

🌎 Select Language
πŸ€– Select AI Model

Powered by NVIDIA’s NeMo Fast Conformer, this tool is optimized for high-quality Persian ASR (Automatic Speech Recognition).

πŸ“š Trained on 800+ Hours of Speech Data:

  • Common Voice 17 (~300 hours)
  • YouTube (~400 hours)
  • NasleMana (~90 hours)
  • In-house dataset (~70 hours)

πŸ“œ License & Business Inquiries

This application is licensed under Creative Commons Attribution-NonCommercial 4.0 (CC BY-NC 4.0).

  • πŸ›‘ Non-Commercial Use Only – Commercial use is not permitted without prior approval.
  • πŸ”— Attribution Required – Credit must be given to FAIM Group, Sharif University of Technology.
  • ❌ No Derivatives – Modifications or adaptations of this work are not allowed.

πŸ“œ Full License Details: CC BY-NC 4.0

πŸ“© Business Inquiries:
If you're interested in commercial applications, please contact us at:
βœ‰οΈ Email: saeedzou2012@gmail.com