Learn Before
Short Answer

Defining Inputs and Outputs in Captioning

Question: In an end-to-end image captioning neural network, what do the variables x and y represent respectively?

Sample answer: The variable x represents the input image, and the variable y represents the directly output caption.

Key points:

  • x is the input image.
  • y is the output caption.

Rubric: The answer is correct if it assigns the image to variable x and the caption to variable y.

0

1

Updated 2026-05-27

Contributors are:

Who are from:

Tags

Python Programming Language

Data Science

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Machine Learning Strategy

Machine Learning Yearning @ DeepLearning.AI