Audio

96 results

Acoustic Model Training :

Improve speech recognition accuracy for your use case in regional dialects and domain-specific acoustical environments.

Learn more

Advanced Punctuation :

Speechmatics’ Advanced Punctuation is built on over 2.5 billion words and has an industry-leading set of supported punctuation marks. This use of punctuation optimises the speed and ease of consuming a transcript for human users.

Learn more

Agent Performance & Quality Monitoring :

Once you begin tracking agent performance metrics, you will be able to see the impacts on average handle time and hold time, as well as volume of calls, sentiment, and top themes handled for a single call or multiple calls.

Learn more

Ai Predictive Analytics :

We train custome predictive models to detect complex events and the likelihood of future behaviour. VoiceBase AI takes a different approach to speech analytics, by leveraging patterns and data not recognizable to the human eye. We combined big voice…

Learn more

Analytics :

Global eCDN administration in a single, web-based interface. Pre-event simulations and insightful analytics let you easily assess the health of your video network before, during and after video events.

Learn more

Analyzing Conversations\n :

Businesses rely on Rev . ai to optimize and monitor conversations with customers. Speaker diarization lets you analyze and improve the quality of each interaction.\n

Learn more

Audio And Speech :

Audio transcription and categorization to power home agents and other voice-controlled devices.

Learn more

Audio Description :

3Play Media provides high quality, competitively priced audio description services for online video. Once your media files have been uploaded to our system, your audio description will be created by professional describers who utilize our unique wor…

Learn more

Audio & Music Identification :

nan

Learn more

Auto-detect Language :

When you need to support multilingual scenarios, you can now specify two to four language codes and Cloud Speech-to-Text will identify the correct language spoken and provide the transcript.

Learn more

Automatic Punctuation :

Accurately punctuates transcriptions (e.g., commas, question marks, and periods) with machine learning.

Learn more

Automatic Sample Rate Detection :

Speechmatics’ ASR can determine the sample rate of each media file and applies the most appropriate transcription model to optimise accuracy.

Learn more

Automatic Speech Recognition :

Recognize speech using Watson’s powerful deep learning neural technologies to enable your applications in voice conversation, transcription and analytics.

Learn more

Automatic Speech Recognition :

Google Cloud Speech-to-Text enables developers to convert audio to text by applying powerful neural network models in an easy-to-use API. The API recognizes 120 languages and variants to support your global user base. You can enable voice command-an…

Learn more

Automatic Speech Recognition :

nan

Learn more

Automatic Speech Recognition :

Most speech recognition systems output a string of text without punctuation. Amazon Transcribe uses deep learning to add punctuation and formatting automatically, so that the output is more intelligible and can be used without any further editing.

Learn more

Business Scenarios Built On Speech Services :

Easily transcribe every call and optimize results through batch transcription and custom speech services enhanced for call center scenarios. Index call transcriptions for full-text search, or apply text analytics to detect sentiment, language, and k…

Learn more

Captioning :

3Play Media delivers competitively priced closed captions and transcripts that are word-to-word synchronized and more than 99% accurate, even in cases of poor audio quality, multiple speakers, difficult content, and accents.

Learn more

Cataloging Archives :

Transcribe your audio and video archives to make them searchable and editable. Rev.ai helps you quickly locate specific words and phrases within time-stamped transcripts.

Learn more

Catching :

Save your network from bandwidth-intensive enterprise video by bringing the video closer to viewers. Intelligent video caching reduces the bandwidth used by live and on-demand video by 90% or more.

Learn more

Channel Diarisation :

Detect and label speakers on up to 6 streams or channels.

Learn more

Channel Identification :

Amazon Transcribe is able to process audio and video where each speaker is recorded on different channels. Contact centers stand to benefit significantly by submitting a single audio file to Amazon Transcribe, which will identify each channel and pr…

Learn more

Cloudpbx :

The CloudPBX service is an advanced business phone system, instantly available and great for start-ups, small teams and medium-sized teams. The service operates on a highly scalable, feature-rich and reliable telecoms platform that gives you a phone…

Learn more

Collective Ai :

Our vision for a powerful architecture for contribution and collaboration among developers, where they can easily extend the functionality of existing Artificial Intelligence domains. Collective AI TM offers comprehensive knowledge, that is always l…

Learn more

Confidence Scores :

Visualise the confidence of every word within the transcript.

Learn more

Content Analysis :

Classify content and extract sentiment from employee and customer interactions to discover what is most important to employees and customers. Understand how they feel about company initiatives. EpiAnalytics makes decision making easier so you can t…

Learn more

Conversational Intelligence :

Amazon Lex provides automatic speech recognition and natural language understanding technologies to create a Speech Language Understanding system. Amazon Lex uses the same proven technology that powers Alexa. Amazon Lex is able to learn the many dif…

Learn more

Conversational Intelligence :

nan

Learn more

Custom Dictionary And Sounds Feature :

Add a set of context-specific words to the dictionary to enhance your transcription accuracy. Learn more about the benefits of Custom Dictionary.

Learn more

Customized Speech Recognition :

Manually customize speech recognition for your business by specifying up to 5,000 words or phrases that are likely to be spoken (such as product names). Also automatically convert spoken numbers into addresses, years, or currencies, or do other conv…

Learn more

Customized Vocabulary :

Add words and phrases to adapt vocabulary for your business to improve transcription accuracy

Learn more

Custom Trigger Phrase :

nan

Learn more

Custom Vocabulary :

Amazon Transcribe gives you the ability to expand and customize the speech recognition vocabulary. You can add new words to the base vocabulary and generate highly-accurate transcriptions specific to your use case, such as product names, domain-spec…

Learn more

Deep Meaning Understanding :

Ask multiple questions and filter results all at once. Answers your most complex questions anywhere: on any connected device, app, or website — at home or on the go.\n\nFor example, users can say “show me hotels in san francisco for tomorrow that ar…

Learn more

Gender Identification :

Discover gender-based trends using gender identification

Learn more

Grammars Training :

Improve speech recognition accuracy for your use case by applying rules to recognize specific phrases, words, letters, numbers or lists.

Learn more

Inappropriate Content Filtering :

Filter inappropriate content in text results for some languages.

Learn more

Inbound Voice :

The Inbound Voice service allows voice calls from landlines and mobiles to be received by a telephone number and routed to a VoIP SIP endpoint of your choice. You can allocate to your account, and receive calls on, telephone numbers from the followi…

Learn more

Knowledge Extraction :

Surface actionable intelligence from calls, lectures and conferences. We make recorded content discoverable through Knowledge Extraction. VoiceBase automatically extracts keywords and generates overall topics from audio and video recordings to provi…

Learn more

Knowledge Graphs :

nan

Learn more

Language Identification :

Automatically identify the language spoken for transcription

Learn more

Language Model Training :

Improve speech recognition accuracy for your use case in domain-specific terminology, acronyms, product names, jargons and expressions.

Learn more

Live Captioning :

3Play Media’s live automatic captioning service streamlines the traditional live captioning workflow by integrating with major video platforms to post live captions directly back to your stream.

Learn more

Low Latency Finals :

Low Latency Finals enable the most accurate real-time transcription by leveraging Speechmatics’ proprietary rescoring outputs. The feature is able to define the context of a transcript and automatically correct words to match the context.

Learn more

Model Selection :

Choose from a selection of four pre-built models: default, voice commands and search, phone calls, and video transcription.

Learn more

Mrcp Connector :

The CereProc cServer MRCP Connector provides a complete standards-based TTS solution for IVR environments. IETF MRCP versions 1 and 2 are supported, greatly simplifying platform integration. CereProc has developed the world's most natural TTS voices…

Learn more

Multicast :

Multicasting is the most efficient way to stream live video. Instead of sending countless video streams across your network to reach every viewer, it distributes a single stream for all your viewers.

Learn more

Multichannel Recognition :

In multiparticipant recordings where each participant is recorded in a separate channel (e.g., phone call with two channels or video conference with four channels), Cloud Speech-to-Text will recognize each channel separately and then annotate the tr…

Learn more

Multilingual Text To Speech :

nan

Learn more

Multiple Audio Transmission Choices :

Stream real-time audio directly from your application or upload pre-recorded audio files. Many audio formats in various states of file compression are supported.

Learn more

Natural Language Generation :

nan

Learn more

Numeric Redaction :

Protect your users’ data by masking sensitive numeric data from your speech transcripts, such as credit card data, social security numbers and phone numbers.

Learn more

Real-time Audio Diagnostics :

Ask your user to come closer to the microphone due to background noise by analyzing the signal characteristics of your input audio in real-time.

Learn more

Real-time Streaming Or Prerecorded Audio Support :

Audio input can be streamed from an application’s microphone or sent from a prerecorded audio file (inline or through Google Cloud Storage). Multiple audio encodings are supported, including FLAC, AMR, PCMU, and Linear-16.

Learn more

Real-time Transcription :

Improve your NPS® and customer satisfaction outcomes using real-time transcription

Learn more

Recognize Multiple Speakers :

Amazon Transcribe is able to recognize when the speaker changes and attribute the transcribed text appropriately. This can significantly reduce the amount of work needed to transcribe audio with multiple speakers like telephone calls, meetings, and …

Learn more

Sentiment And Emotion Analysis :

Determine customer satisfaction through sentiment and emotion analysis

Learn more

Service Snapshot :

Convert text to lifelike speech. Speech stream delivered via email, HTTPS callback, URL, SIP or SMS.\nConvert speech to text.\nAllocate inbound telephone numbers.\nInbound voice to SIP endpoint.

Learn more

Smart Formatting :

More easily read dates, times, numbers, currency values, email and website addresses in your final transcripts by converting them into conventional forms.

Learn more

Sous-titrage Automatique :

Chez AmberScript, nous pensons que le contenu vidéo et audio doit être accessible à tous - également les personnes ayant des troubles de l'audition. Si quelqu'un ne peut pas entendre cela ne signifie pas que la personne ne peut pas profiter de vidéo…

Learn more

Spanish Captioning & Transcription Services :

We provide premium quality closed captioning and transcription for Spanish-language prerecorded content. Our pricing is extremely competitive and the accuracy is more than 99% accurate, even in cases of poor audio quality, multiple speakers, difficu…

Learn more

Speaker Change :

Easily identify a change of speaker within your transcript with Speaker Change. A token is automatically added to the transcript each time a speaker change is detected. This enables easy modification of the transcript to improve its readability.

Learn more

Speaker Diarisation :

Detect and label different speakers within the same channel.

Learn more

Speaker Diarization :

Recognize who said what in a multi-participant voice exchange. Currently optimized for two-way call center conversations but can detect up to 6 different speakers.

Learn more

Speaker Diarization :

Analyze customer and agent voices separately in your transcripts

Learn more

Speaker Diarization :

Know who said what - you can now get automatic predictions about which of the speakers in a conversation spoke each utterance.

Learn more

Speaker Identification :

Identify who is speaking. The API can be used to determine the identity of an unknown speaker. Input audio of the unknown speaker is paired against a group of selected speakers, and in the case there is a match found, the speaker’s identity is retur…

Learn more

Speaker Verification :

Use your voice for verification. The API can be used to power applications with an intelligent verification tool. If the speaker claims to be of a certain identity use voice to verify this claim.

Learn more

Speech Analytics :

Evaluate and categorize calls at scale to build scorecards, reports, dashboards and KPI's\nWe created VoiceBase Speech Analytics with a revolutionary query and categorization solution. Now analysts can inspect calls with previously unattainable gran…

Learn more

Speech Recognition :

Turnaround time in 3X the length of your audio file (1 min of audio takes 3 minutes to transcribe).

Learn more

Speech Recognition :

Voci deep learning speech recognition goes beyond earlier call center technologies. Our solutions feature proprietary machine learning and deep neural networks that ingest and “learn” voice data, enabling your call center to identify customer behavi…

Learn more

Speech Recognition :

nan

Learn more

Speech Recognition :

Our automatic speech recognition (ASR) converts spoken word into text with best-in-class accuracy./ Automatically punctuate (commas, question marks, periods, etc.) and capitalize for an easy-to-read transcript.\n/ Receive a timestamp for each word.

Learn more

Speech-to-meaning :

Understand the meaning, not just the words. The Speech-to-Meaning TM engine delivers unprecedented speed and accuracy and can be integrated with mobile applications, cloud software, connected devices and, ultimately, the Internet of Things.

Learn more

Speech-to-text :

EpiAnalytics automatically provides structure for your call recordings whether you are monitoring calls for sales or support issues. Listen to every call the same way a person would listen to a call or audio recording. Our highly scalable solution …

Learn more

Speech To Text :

Transcribe large volumes of recorded audio quickly via lightning-fast GPUs

Learn more

Speech To Text :

Converts spoken audio to text for intuitive interaction\nEasily add real-time speech-to-text capabilities to your applications for scenarios like voice commands, conversation transcription, and call center log analysis.\nTailor your speech recogniti…

Learn more

Speech To Text :

The Speech-to-Text service allows an application to have a speech-to-text (STT) conversion performed on a long or short audio stream and for the speech in that audio stream to be transcribed as text. This service can be used in interactive systems (…

Learn more

Speech To Text :

transcribe audio and video recordings with deep learning speech recognition. VoiceBase was conceived to do for voice what Google did for text: make it instantly searchable and shareable by creating a rich, queryable database. Our effective speech an…

Learn more

Speech Translation :

Give your app real-time speech translation capabilities in any of the supported languages and receive either a text or speech translation back. Speech Translation models are based on leading-edge speech recognition and neural machine translation (NM…

Learn more

Streaming Transcription :

With Amazon Transcribe, you can transcribe audio to text in real time. Using a secure connection over the HTTP 2 protocol, you can send a live audio stream to the service, and in return, receive a stream of text in real time.

Learn more

Supports Context And Follow‑up :

Use context, such as the user's location or previous queries, to support natural interactions. \n\nFor example, users can say “show me hotels around me,” “how about in san francisco,” “nothing more than $200 per night.”

Learn more

Text To Speech :

Give natural voice to your apps\nBuild smart apps and services that speak to users naturally with the Text to Speech service. Convert text to audio in near real time, tailor to change the speed of speech, pitch, volume, and more.\nGive your applicat…

Learn more

Text To Speech :

Google Cloud Text-to-Speech enables developers to synthesize natural-sounding speech with 100+ voices, available in multiple languages and variants. It applies DeepMind’s groundbreaking research in WaveNet and Google’s powerful neural networks to de…

Learn more

Text To Speech :

Amazon Polly provides an API that enables you to quickly integrate speech synthesis into your application. You simply send the text you want converted into speech to the Amazon Polly API, and Amazon Polly immediately returns the audio stream to your…

Learn more

Text To Speech :

With Watson Text-to-Speech, you can generate human-like audio from written text. Improve the customer experience and engagement by interacting with users in multiple languages and tones. Increase content accessibility for users with different abili…

Learn more

Text To Speech :

Text-O-Phone (ToP) is the complete, flexible, easy to integrate multilingual front-end solution for text-to-speech systems developed by CELI. It covers the complete processing pipeline form standard text to phonetic annotation – including stress and…

Learn more

Text To Speech Server :

The CereProc cServer is a mature, stable, speech server platform. It is supported on Windows and Linux, offering high performance and availability in multi-threaded, multi-channel environments.\n

Learn more

Timestamp Generation :

Amazon Transcribe returns a timestamp for each word, so that you can easily locate the audio in the original recording by searching for the text.

Learn more

Transcription :

customized transcription services that can meet any project specifications, including specialized formatting and recurring delivery schedules

Learn more

Transcription :

3Play Media provides premium transcription services with a guaranteed accuracy rate of 99%, even in cases of difficult audio, background noise, and accents.

Learn more

Transcription Automatique :

Chez AmberScript, nous pensons que le contenu vidéo et audio doit être accessible à tous - également les personnes ayant des troubles de l'audition. Si quelqu'un ne peut pas entendre cela ne signifie pas que la personne ne peut pas profiter de vidéo…

Learn more

Translation :

Translate your audio, video or text files into any foreign language, quickly and accurately.

Learn more

Translation And Subtitling :

After your video or audio files have been captioned (or transcripts aligned), we make it easy to translate them into many different languages. Our translation services are seamlessly integrated with our captioning and transcription services.

Learn more

Voice To Text Transcription :

Clarabridge helps you transcribe audio recordings into text and then immediately into structured, reportable data. Clarabridge’s Voice Transcription Service uses a patented chip-based algorithm to transcribe voice of the customer data 3,000 times fa…

Learn more

Word Spotting And Filtering :

Filter for specific words or inappropriate content by using our keyword spotting and profanity filtering* features. *US English only

Learn more
Load more ...