Learn Before
Diagnosing incorrect transcriptions in a speech recognition inference system
Case context: You are building a speech-to-text inference system. For a given audio input A, the system outputs transcription S_out, which is incorrect. The target correct transcription is S_star. The system uses beam search to approximate finding a transcription S that maximizes ScoreA(S). To diagnose the failure, you compute the scores and find that ScoreA(S_star) is greater than ScoreA(S_out), but the beam search returned S_out.
Question: Based on the provided scoring relationship, diagnose whether this is a search algorithm problem or a scoring function problem, and decide which development effort you should prioritize to fix this error.
Sample answer: Since ScoreA(S_star) is greater than ScoreA(S_out), the scoring function correctly assigned a higher score to the correct transcription S_star. However, the system still output S_out. This means the approximate search algorithm (beam search) failed to find the value S that maximizes the score, which is a search algorithm problem. Therefore, you should prioritize improving the search algorithm itself rather than the learning algorithm that estimates the score.
Key points:
- Identify that the scoring function correctly preferred the true output over the returned output because ScoreA(S_star) > ScoreA(S_out).
- Conclude that the beam search failed to find the higher-scoring correct transcription, which is a search algorithm problem.
- Decide to prioritize improving the search algorithm (beam search) rather than the learning algorithm.
Rubric: The response must identify the issue as a search algorithm problem based on the fact that the correct answer had a higher score but was not selected. It must also specify that the priority should be to improve the search algorithm.
0
1
References
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Search Algorithm Problem in Inference
Scoring Function Problem in Inference
Optimization Verification Test
When an inference system returns a wrong output, which two root causes should a developer distinguish between to debug effectively?
Identifying whether the search algorithm or scoring function caused an inference failure determines which component you should prioritize improving.
If the approximate search algorithm fails to find the value of S that maximizes Score_A(S), the correct fix is to work on improving the _____ algorithm.
Match each inference failure type to its precise description as defined in Machine Learning Yearning.
Arrange the steps of the inference debugging process described in MLY in the correct logical order.
According to MLY, what is a 'scoring function problem' in the context of debugging inference systems?
Randomly choosing whether to improve the search algorithm or scoring function is an acceptable debugging strategy when an inference system produces wrong outputs.
When the scoring function is identified as the inference failure cause, you should work on the _____ algorithm that estimates Score_A(S).
Match each identified inference failure cause to the correct recommended action from Machine Learning Yearning.
Arrange the reasoning steps used to determine whether an inference failure is caused by an inaccurate scoring function.
Contrast the target actions required when debugging an inference system's search algorithm versus its scoring function.
Diagnosing incorrect transcriptions in a speech recognition inference system
Consequence of randomly choosing between improving search or scoring during inference debugging