INDEX
Explanations
media captions and related textual descriptions
captions in media-related content
New Auto-Interp
Negative Logits
ĪĴ
-0.79
walker
-0.72
Magikarp
-0.72
nesia
-0.70
vals
-0.69
Ö¼
-0.66
bowling
-0.64
urat
-0.64
atto
-0.64
»Ĵ
-0.64
POSITIVE LOGITS
WATCH
0.83
acters
0.77
Theresa
0.72
Corbyn
0.69
ITV
0.69
BBC
0.69
Natasha
0.67
Prof
0.67
Survive
0.66
=-=-=-=-=-=-=-=-
0.66
Activations Density 0.004%