Add support for partial transcription prefix in the prompt#15449
Add support for partial transcription prefix in the prompt#15449azziko wants to merge 3 commits intoNVIDIA-NeMo:mainfrom
Conversation
Signed-off-by: azziko <sharipov.wdev@gmail.com>
|
Thank you for a very clean usage example! Does this approach work well with the pretrained canary-v2, or did you train your own model with some modifications for streaming? If it's possible to share any numbers, I'd be curious to learn more. Can you add the tests either to |
Signed-off-by: azziko <sharipov.wdev@gmail.com>
Signed-off-by: azziko <azziko@users.noreply.github.com>
|
Thank you for a quick review! For my purposes and tests I have been using the pretrained canary-v2 model. My decoding parameters were as follows(let me know if you would like to know some specific numbers that I might have missed, I will happily share them too): |
|
Thanks. I was just wondering if you have any WER comparison to other approaches or models - I would have expected canary2 to degrade with this technique. |
Important
The
Update branchbutton must only be pressed in very rare occassions.An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.
What does this PR do ?
Add support for partial transcription of the current audio input. This is especially useful in the streaming scenarios.
Collection: [ASR]
Changelog
Usage
Can be used as an input propmt to the top level .transcribe() function. The partially transcribed part is ommited in the hypothesis. Must be used as the last turn:
GitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information