INDEX
Explanations
proper names
mentions of specific names and individuals
New Auto-Interp
Negative Logits
undown
-0.70
eller
-0.69
ĪĴ
-0.69
oxide
-0.66
fml
-0.66
yip
-0.64
carriers
-0.63
erving
-0.63
reversible
-0.62
imilar
-0.61
POSITIVE LOGITS
sson
0.69
SAY
0.65
Cout
0.65
wine
0.64
Picks
0.64
Misty
0.64
Gets
0.64
null
0.63
Mata
0.62
Rid
0.62
Activations Density 0.095%