Case Study

Diagnose search suboptimality in a scored speech recognition system using approximate search.

Case context: You build a speech recognition system that inputs audio clip A and computes ScoreA(S) = P(S|A). Because exhaustively enumerating all (50,000)^N sentences of length N is impossible, you implement an approximate search algorithm to find the arg max of ScoreA(S). During an evaluation, the system outputs transcription S_approx. However, a manual check reveals that a different English transcription, S_better, has a higher computed score (ScoreA(S_better) > ScoreA(S_approx)).

Question: Diagnose why the system returned S_approx instead of S_better. What does this reveal about the approximate search algorithm's guarantees?

Sample answer: The system returned S_approx instead of S_better because the approximate search algorithm failed to find the sentence that maximizes ScoreA(S). This reveals that the approximate search algorithm is not guaranteed to find the true score-maximizing output, even when a higher-scoring output exists in the search space.

Key points:

  • The approximate search algorithm failed to find the score-maximizing transcription.
  • Approximate search algorithms are not guaranteed to find the output that maximizes ScoreA(S).
  • The higher-scoring transcription S_better was missed due to search approximation rather than scoring error.

Rubric: The response must explain that: 1. The approximate search algorithm failed to identify the optimal output S_better despite it having a higher score. 2. This behavior is due to the inherent limitation of approximate search algorithms, which do not guarantee finding the absolute maximum of the score function.

0

1

Updated 2026-05-26

Contributors are:

Who are from:

Tags

Data Science

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Machine Learning Strategy

Related