INDEX
Explanations
names of specific individuals
proper nouns, specifically names of individuals
New Auto-Interp
Negative Logits
tumblr
-0.73
Marketable
-0.71
polluted
-0.64
derailed
-0.63
coincided
-0.62
chemicals
-0.62
coincides
-0.61
sexual
-0.61
constitutes
-0.61
engulfed
-0.60
POSITIVE LOGITS
APS
0.72
Jace
0.71
anton
0.70
UA
0.68
Kis
0.67
Atkinson
0.67
Angelo
0.67
cia
0.66
NP
0.65
Arch
0.64
Activations Density 0.195%