Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
-
Updated
Jun 17, 2025 - Python
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Pybind11 bindings for Whisper.cpp
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
A static site demonstrating real-time audio transcription via Amazon Transcribe over a WebSocket.
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.
Free speech to text
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time audio transcription using OPENAI's Whisper Ai.
The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.
Transcription and annotation interface for recorded audio or video files
Generate subtitles for long movies / podcasts with OpenAI Whisper API.
Speakscribe is a web application that allows users to transcribe audios using OpenAI and also interact with a chat bot. The web application is created in Python using NiceGUI.
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
The GroqCloud API wrapper for Delphi provides access to models from Meta, OpenAI, MistralAI and Google on Groq’s LPUs, offering chat, text generation, image analysis, audio transcription, JSON output, tool integration, and content moderation capabilities.
Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the transcriptions as PDF documents, enhanced by the gpt-3.5-turbo model.
OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.
Add a description, image, and links to the audio-transcription topic page so that developers can more easily learn about it.
To associate your repository with the audio-transcription topic, visit your repo's landing page and select "manage topics."
骨骼肌率是什么意思 | 57年的鸡是什么命 | 芒种可以种什么菜 | 天空为什么是蓝色的 | 修面皮是什么皮 |
头疼想吐是什么原因引起的 | 尿毒症吃什么最好 | 什么地制宜 | 经常吃辣椒有什么好处和坏处 | 白猫进家有什么预兆 |
生辉是什么意思 | 临幸是什么意思 | 健脾丸和归脾丸有什么区别 | quest是什么车 | 什么人不宜喝咖啡 |
疑心病是什么意思 | 胃不好吃什么蔬菜 | 冠状动脉ct检查什么 | 怀不上孕做什么检查 | kksk是什么意思 |
前兆是什么意思hlguo.com | 理疗和按摩有什么区别hcv8jop8ns6r.cn | 揽件是什么意思hcv7jop7ns4r.cn | 收心是什么意思xinmaowt.com | 柯萨奇病毒是什么病hcv8jop5ns8r.cn |
喝什么茶最养胃jiuxinfghf.com | 金钱草什么样hcv8jop1ns6r.cn | 长期吃阿司匹林有什么副作用hcv7jop7ns2r.cn | 骨皮质断裂是什么意思hcv9jop1ns4r.cn | 啄木鸟包包什么档次aiwuzhiyu.com |
吃什么食物可以降低尿酸hcv8jop8ns1r.cn | 唇炎吃什么药hcv9jop8ns0r.cn | 情人节送什么hcv8jop4ns3r.cn | 梦见流鼻血是什么征兆hcv8jop8ns4r.cn | 什么是原生家庭hcv9jop5ns0r.cn |
八字中的印是什么意思hcv9jop0ns0r.cn | 枸橼酸西地那非片有什么副作用hcv9jop3ns3r.cn | 痛风可以吃什么鱼hcv9jop2ns4r.cn | 小孩走路迟是什么原因hcv8jop2ns2r.cn | 菲妮迪女装是什么档次hcv9jop6ns1r.cn |