For Canary Varients for a AST task, would it be sufficient to train AST alone, or would ASR need to be trained #15280
-
|
I added a new language to Canary's tokenizer. Now I only need to translate this x language to English. Do I need to train the x language on the ASR task first ? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
Canary AST vs ASR training — great question! Short answer: AST-only training CAN work, but joint training is better. Why:
Config for AST-only: model:
task: "ast" # Audio-to-text only
freeze_encoder: false # Or true if pre-trainedConfig for joint: model:
tasks: ["asr", "ast"]
task_weights:
asr: 0.5
ast: 0.5Recommendation: We train multilingual models at RevolutionAI. What's your source/target language pair? |
Beta Was this translation helpful? Give feedback.
Canary AST vs ASR training — great question!
Short answer: AST-only training CAN work, but joint training is better.
Why:
Shared encoder benefits
When AST-only works:
When joint is better:
Config for AST-only:
Config for joint:
Recommendation:
If you have an ASR checkpo…