Managing audio is a key part of recent enterprise. Zoom conferences, YouTube movies, and podcasts, as examples, contain listening, adjusting, and transcribing speech.
There’s no scarcity of subtle audio instruments. I’ve examined a number of them that declare to make use of synthetic intelligence. Listed here are my findings organized by the use case:
- Podcast instruments,
- Conferences, webinars,
- Textual content to audio.
AI-powered Audio Instruments
Transcribing. AudioNotes makes use of synthetic intelligence to transform speech into textual content. Add an audio file or converse to the instrument and it’ll transcribe and summarize. The transcription is on the market for 30 languages, however the summaries are in English solely. My checks for this submit have been in English.
AudioNotes saves all transcripts and summaries robotically in customers’ dashboards. Sadly, it doesn’t save the audio contained in the textual content. Customers can tag the notes to seek out them rapidly and might share them with different registered customers.
Use the instrument to create transcripts of movies or podcasts or to file concepts and descriptions. In a stay chat, an AudioNotes rep instructed me an iPhone app is coming in “two to 3 weeks.”
AudioNotes affords a free, restricted plan and a blizzard of paid plans below “Private,” “Professional,” and “PodNotes” classes. Every has a number of pricing fashions starting from $49 per yr to $249 monthly.
Recorder, a free Google app for Android gadgets, is a detailed various to AudioNotes.
Podcast instruments. Podcastle is a multi-feature instrument for creating higher podcasts. It makes use of AI to:
- Enhance audio high quality by eradicating background noises,
- Create podcast transcripts and descriptions,
- Detect and take away filler phrases — e.g., “um,” “ah,” “like,” “you understand.”
The free “Primary” plan consists of three hours of audio, restricted entry to the enhancing instruments, and a watermark on the transcriptions, amongst different limitations. Paid plans are “Storyteller” and “Professional” for $11.99 and $23.99 monthly. Each have in depth options and capability.
Shut alternate options to Podcastle are Auphonic, Descript, and Adobe Podcasts.
Conferences, webinars. Otter is an AI assistant that robotically generates assembly transcripts and summaries. Invite Otter to your conferences on Zoom, Microsoft Groups, or Google Meet. It can flip voices into textual content and seize slides.
Add feedback to the transcript and share together with your group. The abstract resembles a desk of contents: clicking sections will take you to that spot within the recorded audio, making recordings simple to navigate.
Otter can be a useful instrument to file and transcribe podcasts.
Otter’s free “Primary” plan consists of 300 month-to-month transcription minutes — half-hour per dialog. “Professional” and “Enterprise” plans value $8 and $20 monthly, billed yearly.
MeetGeek is a detailed various to Otter.
Textual content to audio. Murf is an AI voice generator to show textual content into speech. It’s helpful for creating video voiceovers — comparable to for YouTube and TikTok — and audio variations of articles. Paste the textual content into the instrument, and it’ll generate the audio.
Murf affords a number of voices — male, feminine, educator, developer, extra. The variations have been stark in my testing. Some voices have been significantly better than others. So pay attention to a couple earlier than choosing one. Then add pauses and alter the velocity and pitch as wanted.
Murf’s free plan consists of 10 minutes of voice technology and three customers. “Primary,” “Professional,” and “Enterprise” plans are $19, $26, and $99, per person monthly, billed yearly. Every affords progressively extra options.
Speechify is a detailed various to Murf.