π Persian ASR, Translation & Summarization
Welcome to the Persian Speech-to-Text & NLP platform! This app allows you to upload an audio file,
get an accurate transcription, and enhance the output with translation, summarization,
and punctuation restoration.
π― How It Works
1οΈβ£ Upload an audio file containing Persian speech. To transcribe YouTube videos, open this Colab Notebook (If your request was detected as bot, disconnect and run all cells again).
2οΈβ£ Click "Transcribe" to generate the text output.
3οΈβ£ Use additional features: Translate, Summarize, or Restore Punctuation.
4οΈβ£ Customize settings: Select a language, AI model, and summary length.
5οΈβ£ View and copy the processed text for your use!
Powered by NVIDIAβs NeMo Fast Conformer, this tool is optimized for high-quality Persian ASR (Automatic Speech Recognition).
π Trained on 800+ Hours of Speech Data:
- Common Voice 17 (~300 hours)
- YouTube (~400 hours)
- NasleMana (~90 hours)
- In-house dataset (~70 hours)
π License & Business Inquiries
This application is licensed under Creative Commons Attribution-NonCommercial 4.0 (CC BY-NC 4.0).
- π Non-Commercial Use Only β Commercial use is not permitted without prior approval.
- π Attribution Required β Credit must be given to FAIM Group, Sharif University of Technology.
- β No Derivatives β Modifications or adaptations of this work are not allowed.
π Full License Details: CC BY-NC 4.0
π© Business Inquiries:
If you're interested in commercial applications, please contact us at:
βοΈ Email: saeedzou2012@gmail.com