whisper-tiny-en Beta
Automatic Speech Recognition • OpenAIWhisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalize to many datasets and domains without the need for fine-tuning. This is the English-only version of the Whisper Tiny model which was trained on the task of speech recognition.
Usage
Workers - TypeScript
curl
Parameters
Input
-
0
string -
1
object-
audio
arrayAn array of integers that represent the audio data constrained to 8-bit unsigned integer values
-
items
numberA value between 0 and 255
-
-
source_lang
stringThe language of the recorded audio
-
target_lang
stringThe language to translate the transcription into. Currently only English is supported.
-
Output
-
text
stringThe transcription
-
word_count
number -
words
array-
items
object-
word
string -
start
numberThe second this word begins in the recording
-
end
numberThe ending second when the word completes
-
-
-
vtt
string
API Schemas
The following schemas are based on JSON Schema