Skip to content

Commit d0770f4

Browse files
Update README.md
1 parent ef085fd commit d0770f4

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@ The definitive, open-source Swift framework for interfacing with generative AI.
6262
* [Vision: Image-to-Text](#vision-image-to-text)
6363
* [DALLE-3 Image Generation](#dalle-3-image-generation)
6464
* [Audio](#audio)
65-
* [Audio Transcription](#audio-transcription)
65+
* [Audio Transcription: Whisper](#audio-transcription-whisper)
6666
* [Audio Generation: OpenAI](#audio-generation-openai)
6767
* [Audio Generation: ElevenLabs](#audio-generation-elevenlabs)
6868
* [Text Embeddings](#text-embeddings)
@@ -300,7 +300,7 @@ if let imageURL = images.first?.url {
300300
## Audio
301301
Adding audio generation and transcription to mobile apps is becoming increasingly important as users grow more comfortable speaking directly to apps for responses or having their audio input transcribed efficiently. Preternatural enables seamless integration with these cutting-edge, continually improving AI technologies.
302302

303-
### Audio Transcription
303+
### Audio Transcription: Whisper
304304
[Whisper](https://openai.com/index/whisper/), created and open-sourced by OpenAI, is an Automatic Speech Recognition (ASR) system trained on 680,000 hours of mostly English audio content collected from the web. This makes Whisper particularly impressive at transcribing audio with background noise and varying accents compared to its predecessors. Another notable feature is its ability to transcribe audio with correct sentence punctuation.
305305

306306
```swift

0 commit comments

Comments
 (0)