GitHub - k2-fsa/sherpa-onnx at v1.10.14

3 Branches 130 Tags

Name	Name	Last commit message	Last commit date
Latest commit csukuangfj Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#… Jul 12, 2024 117cd7b · Jul 12, 2024 History 689 Commits
.github	.github	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
android	android	Enable to stop TTS generation (#1041 )	Jun 22, 2024
c-api-examples	c-api-examples	Build sherpa-onnx as a single shared library (#1078 )	Jul 6, 2024
cmake	cmake	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
dart-api-examples	dart-api-examples	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
dotnet-examples	dotnet-examples	Add keyword spotting for C# (#1105 )	Jul 10, 2024
ffmpeg-examples	ffmpeg-examples	Build sherpa-onnx as a single shared library (#1078 )	Jul 6, 2024
flutter-examples	flutter-examples	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
flutter	flutter	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
go-api-examples	go-api-examples	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
ios-swift	ios-swift	Update onnxruntime from v1.18.0 to v1.18.1 (#1107 )	Jul 11, 2024
ios-swiftui	ios-swiftui	Update onnxruntime from v1.18.0 to v1.18.1 (#1107 )	Jul 11, 2024
java-api-examples	java-api-examples	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
kotlin-api-examples	kotlin-api-examples	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
mfc-examples	mfc-examples	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
nodejs-addon-examples	nodejs-addon-examples	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
nodejs-examples	nodejs-examples	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
python-api-examples	python-api-examples	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
scripts	scripts	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
sherpa-onnx	sherpa-onnx	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
swift-api-examples	swift-api-examples	Add timestamps about streaming models for Swift API (#1113 )	Jul 12, 2024
toolchains	toolchains	Support RISC-V (#609 )	Feb 25, 2024
wasm	wasm	Inverse text normalization API of streaming ASR for various programmi…	Jun 18, 2024
.clang-format	.clang-format	add java wrapper suppport (#117 )	Apr 15, 2023
.clang-tidy	.clang-tidy	Support clang-tidy (#1034 )	Jun 19, 2024
.flake8	.flake8	add offline websocket server/client (#98 )	Mar 29, 2023
.gitignore	.gitignore	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
CHANGELOG.md	CHANGELOG.md	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
CMakeLists.txt	CMakeLists.txt	Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#…	Jul 12, 2024
CPPLINT.cfg	CPPLINT.cfg	Use static libraries for MFC examples (#210 )	Jul 13, 2023
LICENSE	LICENSE	Use standard apache 2.0 license (#53 )	Feb 22, 2023
MANIFEST.in	MANIFEST.in	Fix building wheels from source. (#632 )	Mar 4, 2024
README.md	README.md	Fix Flutter TTS example for iOS (#1090 )	Jul 8, 2024
build-aarch64-linux-gnu.sh	build-aarch64-linux-gnu.sh	Fix the alsa-lib version to v1.2.12 (#1048 )	Jun 23, 2024
build-android-arm64-v8a.sh	build-android-arm64-v8a.sh	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
build-android-armv7-eabi.sh	build-android-armv7-eabi.sh	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
build-android-x86-64.sh	build-android-x86-64.sh	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
build-android-x86.sh	build-android-x86.sh	Support onnxruntime 1.18.0 (#906 )	Jul 10, 2024
build-arm-linux-gnueabihf.sh	build-arm-linux-gnueabihf.sh	Fix the alsa-lib version to v1.2.12 (#1048 )	Jun 23, 2024
build-ios-no-tts.sh	build-ios-no-tts.sh	Update onnxruntime from v1.18.0 to v1.18.1 (#1107 )	Jul 11, 2024
build-ios-shared.sh	build-ios-shared.sh	Update onnxruntime from v1.18.0 to v1.18.1 (#1107 )	Jul 11, 2024
build-ios.sh	build-ios.sh	Update onnxruntime from v1.18.0 to v1.18.1 (#1107 )	Jul 11, 2024
build-riscv64-linux-gnu.sh	build-riscv64-linux-gnu.sh	Fix the alsa-lib version to v1.2.12 (#1048 )	Jun 23, 2024
build-swift-macos.sh	build-swift-macos.sh	Fix CI errors. (#993 )	Jun 12, 2024
build-wasm-simd-asr.sh	build-wasm-simd-asr.sh	Add WebAssembly for ASR (#604 )	Feb 23, 2024
build-wasm-simd-kws.sh	build-wasm-simd-kws.sh	small fixes to wasm kws. (#672 )	Mar 18, 2024
build-wasm-simd-nodejs.sh	build-wasm-simd-nodejs.sh	return timestamps for WebAssembly (#737 )	Apr 5, 2024
build-wasm-simd-tts.sh	build-wasm-simd-tts.sh	Add WebAssembly for ASR (#604 )	Feb 23, 2024
release.sh	release.sh	Publish pre-compiled libs for Android. (#217 )	Jul 15, 2023
setup.py	setup.py	Support spoken language identification with whisper (#694 )	Mar 24, 2024

Repository files navigation

Supported functions

Speech recognition	Speech synthesis	Speaker verification	Speaker identification
✔️	✔️	✔️	✔️

Spoken Language identification	Audio tagging	Voice activity detection	Keyword spotting
✔️	✔️	✔️	✔️

Supported platforms

Architecture	Android	iOS	Windows	macOS	linux
x64	✔️		✔️	✔️	✔️
x86	✔️		✔️
arm64	✔️	✔️	✔️	✔️	✔️
arm32	✔️				✔️
riscv64					✔️

Supported programming languages

C++	C	Python	C#	Java	JavaScript	Kotlin	Swift	Go	Dart
✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️

It also supports WebAssembly.

Introduction

This repository supports running the following functions locally

Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
Text-to-speech (i.e., TTS)
Speaker identification
Speaker verification
Spoken language identification
Audio tagging
VAD (e.g., silero-vad)
Keyword spotting

on the following platforms and operating systems:

x86, x86_64, 32-bit ARM, 64-bit ARM (arm64, aarch64), RISC-V (riscv64)
Linux, macOS, Windows, openKylin
Android, WearOS
iOS
NodeJS
WebAssembly
Raspberry Pi
RV1126
LicheePi4A
VisionFive 2
旭日X3派
etc

with the following APIs

C++, C, Python, Go, C#
Java, Kotlin, JavaScript
Swift
Dart

Links for pre-built Android APKs

Description	URL	中国用户
Streaming speech recognition	Address	点此
Text-to-speech	Address	点此
Voice activity detection (VAD)	Address	点此
VAD + non-streaming speech recognition	Address	点此
Two-pass speech recognition	Address	点此
Audio tagging	Address	点此
Audio tagging (WearOS)	Address	点此
Speaker identification	Address	点此
Spoken language identification	Address	点此
Keyword spotting	Address	点此

Links for pre-built Flutter APPs

Real-time speech recognition

Description	URL	中国用户
Streaming speech recognition	Address	点此

Text-to-speech

Description	URL	中国用户
Android (arm64-v8a, armeabi-v7a, x86_64)	Address	点此
Linux (x64)	Address	点此
macOS (x64)	Address	点此
macOS (arm64)	Address	点此
Windows (x64)	Address	点此

Note: You need to build from source for iOS.

Links for pre-trained models

Description	URL
Speech recognition (speech to text, ASR)	Address
Text-to-speech (TTS)	Address
VAD	Address
Keyword spotting	Address
Audio tagging	Address
Speaker identification (Speaker ID)	Address
Spoken language identification (Language ID)	See multi-lingual Whisper ASR models from Speech recognition
Punctuation	Address

Useful links

Documentation: https://k2-fsa.github.io/sherpa/onnx/
Bilibili 演示视频: https://search.bilibili.com/all?keyword=%E6%96%B0%E4%B8%80%E4%BB%A3Kaldi

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supported functions

Supported platforms

Supported programming languages

Introduction

Links for pre-built Android APKs

Links for pre-built Flutter APPs

Real-time speech recognition

Text-to-speech

Links for pre-trained models

Useful links

How to reach us

About

Releases 127

Contributors 125

Languages

License

k2-fsa/sherpa-onnx

Folders and files

Latest commit

History

Repository files navigation

Supported functions

Supported platforms

Supported programming languages

Introduction

Links for pre-built Android APKs

Links for pre-built Flutter APPs

Real-time speech recognition

Text-to-speech

Links for pre-trained models

Useful links

How to reach us

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 127

Contributors 125

Languages