INDEX
Explanations
mentions of a specific phrase or brand name, namely "Legend of Korra"
references to the "Legend of Korra" franchise
New Auto-Interp
Negative Logits
ted
-0.72
reon
-0.67
ting
-0.67
ters
-0.67
ffee
-0.67
ushes
-0.66
ushing
-0.65
artney
-0.65
midday
-0.63
station
-0.63
POSITIVE LOGITS
Legend
1.14
Legend
1.11
uin
0.88
naire
0.88
Doc
0.72
Rank
0.70
Maker
0.69
erer
0.67
Tales
0.66
Contin
0.66
Activations Density 0.007%