Learn Before
Formula
Distillation Loss for Relation-Based Knowledge
The distillation loss for relation-based knowledge transfer, based on the relations of feature maps, is calculated as:
Where:
- and are feature maps of the teacher and student models, respectively.
- and are pairs of feature maps chosen from the teacher.
- and are pairs of feature maps chosen from the student.
- and are similarity functions for pairs of feature maps from the models.
- is the correlation function between the teacher and student feature maps.
0
1
Updated 2026-05-10
Contributors are:
Who are from:
Tags
Deep Learning (in Machine learning)
Data Science
Computing Sciences