INDEX
Explanations
languages and cultural references
New Auto-Interp
Negative Logits
ividual
-0.87
ertodd
-0.84
vre
-0.82
igham
-0.82
urion
-0.81
olicy
-0.79
ndra
-0.76
anmar
-0.73
hardt
-0.73
idth
-0.70
POSITIVE LOGITS
translation
1.36
language
1.18
translations
1.16
subtitles
1.15
pronunciation
1.12
language
1.09
Language
1.04
speaking
1.03
diction
1.03
languages
1.01
Activations Density 0.130%