Ebranchformer #1951

KarelVesely84 · 2025-03-03T15:43:59Z

Hello Fangyun @csukuangfj ,
i did extend sherpa-onnx to support our EBranchformer encoder implementation
that we currently use widely at Brno University in Technology.

The EBranchformer code is based on Conformer model from transformers, but the internals are different:
https://github.com/BUTSpeechFIT/huggingface_asr/blob/streaming_karel/src/models/encoders/e_branchformer.py

This allows to pre-train the encoder with BestRQ alg. and then fine-tune with modified icefall.
We would like to deploy it as a production system for streaming ASR (it already works for me locally).

So for us it would be good to have the support directly inside sherpa-onnx,
so we can use the official sherpa-onnx builds.

On the other hand, it is a bit specific model, not yet widespread.
Would you agree on accepting this extension into the codebase ?
Or, should we rely on our cusom builds ?

The Encoder assumes little different preset of input features derived from
Speech2TextFeatureExtractor, hence newly surfacing the FBANK options:
normalize_samples and snip_edges.

Best regards,
Karel

- so ebranchformer feature extraction can be configured from Python - the GlobCmvn is not needed, at it is a module in the OnnxEncoder

csukuangfj · 2025-03-03T16:41:53Z

Thanks! Will review it today.

csukuangfj

Thanks!

Looks great to me. Left only some minor comments

csukuangfj · 2025-03-04T02:04:21Z

sherpa-onnx/csrc/features.cc

@@ -48,7 +48,9 @@ std::string FeatureExtractorConfig::ToString() const {
  os << "feature_dim=" << feature_dim << ", ";
  os << "low_freq=" << low_freq << ", ";
  os << "high_freq=" << high_freq << ", ";
-  os << "dither=" << dither << ")";
+  os << "dither=" << dither << ", ";
+  os << "normalize_samples=" << normalize_samples << ", ";


Suggested change

os << "normalize_samples=" << normalize_samples << ", ";

os << "normalize_samples=" << (normalize_samples ? "True" : "False" )<< ", ";

csukuangfj · 2025-03-04T02:04:43Z

sherpa-onnx/csrc/features.cc

-  os << "dither=" << dither << ")";
+  os << "dither=" << dither << ", ";
+  os << "normalize_samples=" << normalize_samples << ", ";
+  os << "snip_edges=" << snip_edges << ")";


Suggested change

os << "snip_edges=" << snip_edges << ")";

os << "snip_edges=" << (snip_edges ? "True" : "False") << ")";

csukuangfj · 2025-03-04T02:12:03Z

sherpa-onnx/python/csrc/online-stream.cc

@@ -34,6 +51,9 @@ void PybindOnlineStream(py::module *m) {
          py::arg("sample_rate"), py::arg("waveform"), kAcceptWaveformUsage,
          py::call_guard<py::gil_scoped_release>())
      .def("input_finished", &PyClass::InputFinished,
+           py::call_guard<py::gil_scoped_release>())
+      .def("get_frames", &PyClass::GetFrames,
+           py::arg("frame_index"), py::arg("n"),


Suggested change

py::arg("frame_index"), py::arg("n"),

py::arg("frame_index"), py::arg("n"), kGetFramesUsage

so that if you use help(OnlineStream.get_frames), you can view the help info in python.

csukuangfj · 2025-03-04T02:26:26Z

sherpa-onnx/csrc/online-transducer-model.cc

@@ -92,6 +96,8 @@ std::unique_ptr<OnlineTransducerModel> OnlineTransducerModel::Create(
    const auto &model_type = config.model_type;
    if (model_type == "conformer") {
      return std::make_unique<OnlineConformerTransducerModel>(config);
+    } else if (model_type == "ebranchformer") {


Can you also update

sherpa-onnx/sherpa-onnx/csrc/online-transducer-model.cc

Lines 180 to 181 in 41d378a

if (model_type == "conformer") {

return std::make_unique<OnlineConformerTransducerModel>(mgr, config);

csukuangfj · 2025-03-04T02:27:05Z

sherpa-onnx/csrc/online-transducer-model.cc

@@ -115,6 +121,8 @@ std::unique_ptr<OnlineTransducerModel> OnlineTransducerModel::Create(
  switch (model_type) {
    case ModelType::kConformer:
      return std::make_unique<OnlineConformerTransducerModel>(config);
+    case ModelType::kEbranchformer:


Can you also update

sherpa-onnx/sherpa-onnx/csrc/online-transducer-model.cc

Lines 199 to 200 in 41d378a

case ModelType::kConformer:

return std::make_unique<OnlineConformerTransducerModel>(mgr, config);

It is for Android and HarmonyOS.

csukuangfj · 2025-03-04T02:37:12Z

(Please ignore the failed CI tests.)

KarelVesely84 · 2025-03-04T10:45:46Z

ok, good, thank you for the feedback,
the remarks are now integrated into the PR code

csukuangfj

Thank you for your contribution!

KarelVesely84 added 4 commits February 19, 2025 17:08

adding ebranchformer encoder

73750fc

extend surfaced FeatureExtractorConfig

a1b0d38

- so ebranchformer feature extraction can be configured from Python - the GlobCmvn is not needed, at it is a module in the OnnxEncoder

Merge remote-tracking branch 'origin/master' into ebranchformer

01b1c48

clean the code

41d378a

KarelVesely84 force-pushed the ebranchformer branch from 81d6a00 to 41d378a Compare March 3, 2025 16:10

csukuangfj reviewed Mar 4, 2025

View reviewed changes

Integrating remarks from Fangyun

896285d

csukuangfj approved these changes Mar 4, 2025

View reviewed changes

csukuangfj merged commit 7740dbf into k2-fsa:master Mar 4, 2025
163 of 214 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ebranchformer #1951

Ebranchformer #1951

KarelVesely84 commented Mar 3, 2025

csukuangfj commented Mar 3, 2025

csukuangfj left a comment

csukuangfj Mar 4, 2025

csukuangfj Mar 4, 2025

csukuangfj Mar 4, 2025

csukuangfj Mar 4, 2025

csukuangfj Mar 4, 2025

csukuangfj commented Mar 4, 2025

KarelVesely84 commented Mar 4, 2025 •

edited

Loading

csukuangfj left a comment

	os << "normalize_samples=" << normalize_samples << ", ";
	os << "normalize_samples=" << (normalize_samples ? "True" : "False" )<< ", ";

	os << "snip_edges=" << snip_edges << ")";
	os << "snip_edges=" << (snip_edges ? "True" : "False") << ")";

	py::arg("frame_index"), py::arg("n"),
	py::arg("frame_index"), py::arg("n"), kGetFramesUsage

	if (model_type == "conformer") {
	return std::make_unique<OnlineConformerTransducerModel>(mgr, config);

	case ModelType::kConformer:
	return std::make_unique<OnlineConformerTransducerModel>(mgr, config);

Ebranchformer #1951

Ebranchformer #1951

Conversation

KarelVesely84 commented Mar 3, 2025

csukuangfj commented Mar 3, 2025

csukuangfj left a comment

Choose a reason for hiding this comment

csukuangfj Mar 4, 2025

Choose a reason for hiding this comment

csukuangfj Mar 4, 2025

Choose a reason for hiding this comment

csukuangfj Mar 4, 2025

Choose a reason for hiding this comment

csukuangfj Mar 4, 2025

Choose a reason for hiding this comment

csukuangfj Mar 4, 2025

Choose a reason for hiding this comment

csukuangfj commented Mar 4, 2025

KarelVesely84 commented Mar 4, 2025 • edited Loading

csukuangfj left a comment

Choose a reason for hiding this comment

KarelVesely84 commented Mar 4, 2025 •

edited

Loading