WhisperUI is a Speech to Text service built on OpenAI Whisper, a state-of-the-art Automatic Speech Recognition (ASR) system. The platform allows users to convert their audio files into text or SRT files, making it useful for a variety of applications like transcription services, subtitle generation, or linguistic analysis.. WhisperUI supports a broad range of file types including MP3, MP4, MPEG, MPGA, M4A, WAV, and WEBM, with a maximum file size limit set by OpenAI. The Whisper system derives its robustness from having been trained on a comprehensive and diversified data set that includes multilingual and multitask supervised data obtained from the web.. This ensures impressive performance against various accents, background noise, and technical language. Furthermore, Whisper can transcribe speech in multiple languages and translate them into English.. The transcription process begins when a user uploads an audio file to the WhisperUI web application, which then uses OpenAI Whisper to transform the spoken words into text.. The transcribed text is then made available to the user for review and modification. Users need an active OpenAI API Key to use the service, with billing handled directly by OpenAI based on the number of tokens used.. A premium feature set, which includes the ability to upload multiple files at once and daily unlimited uploads, is also available..