feat!: major performance & accuracy improvements in speech-to-text module by IgorSwat · Pull Request #1132 · software-mansion/react-native-executorch

IgorSwat · 2026-05-08T08:26:37Z

Description

This PR introduces several changes to the speech-to-text module based on Whisper models:

CoreML integration - models re-exported to CoreML backend, bringing significant performance upgrade for iOS devices.
New streaming algorithm - eliminates duplicates in streaming output, resulting in a major quality improvement of the live streaming mode.
Changes in demo apps: removed faulty 'voice mode' screen in LLM demo app, refactored speech to text screen in 'speech' app by adding new CoreML models to selection bar and changing the default model for iOS devices.
Minor code improvements in speech-to-text module

Introduces a breaking change?

Yes
No

Change: removes predefined constants for quantized models.
Justification: the quantized models differ very slightly from the original ones, introducing unnecessary complexity in this case.

Type of change

Bug fix (change which fixes an issue)
New feature (change which adds functionality)
Documentation update (improves or adds clarity to existing documentation)
Other (chores, tests, code style improvements etc.)

Tested on

iOS
Android

Testing instructions

Run demo app to test the live streaming mode.

Screenshots

Related issues

#1124

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

Additional notes

msluszniak · 2026-05-08T15:30:33Z

Also if this PR adds breaking change, please describe it directly below Introduces a breaking change? section in PR body.

msluszniak · 2026-05-19T15:06:19Z

Side note, after merging PR with TTS and rebasing, please make sure that native tests works here after all changes.

IgorSwat requested review from benITo47, chmjkb and msluszniak May 8, 2026 08:26

IgorSwat added model Issues related to exporting, improving, fixing ML models improvement PRs or issues focused on improvements in the current codebase labels May 8, 2026

msluszniak assigned IgorSwat May 8, 2026

msluszniak requested changes May 8, 2026

View reviewed changes

IgorSwat changed the title ~~feat: major performance & accuracy improvements in speech-to-text module~~ feat!: major performance & accuracy improvements in speech-to-text module May 8, 2026

msluszniak reviewed May 11, 2026

View reviewed changes

IgorSwat added 10 commits May 19, 2026 13:05

Optimal streaming algorithm

053d022

Revert back to 100ms refresh rate

913054e

Add CoreML whisper models

f1c0465

Update model urls

d6f3c90

Change default model for iOS devices

ccf7285

Add explicit timeout parameter

8291851

Concurrency fixes & automatic cleaunp

7fcf367

Update urls & audio-api

99f01a0

Apply review suggestions

2a22956

Rebase with main

a91344c

IgorSwat force-pushed the @is/speech-to-text-ultimate branch from c5d3c14 to a91344c Compare May 19, 2026 11:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat!: major performance & accuracy improvements in speech-to-text module#1132

feat!: major performance & accuracy improvements in speech-to-text module#1132
IgorSwat wants to merge 10 commits into
mainfrom
@is/speech-to-text-ultimate

IgorSwat commented May 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

msluszniak commented May 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

msluszniak commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

IgorSwat commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Introduces a breaking change?

Type of change

Tested on

Testing instructions

Screenshots

Related issues

Checklist

Additional notes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

msluszniak commented May 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

msluszniak commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

IgorSwat commented May 8, 2026 •

edited

Loading