INDEX
Explanations
references to specific individuals, particularly in the context of politics or public affairs
New Auto-Interp
Negative Logits
erken
-0.08
hood
-0.08
er
-0.07
SCP
-0.07
hard
-0.07
pp
-0.07
intptr
-0.07
rought
-0.07
PP
-0.07
band
-0.06
POSITIVE LOGITS
gomery
0.12
agne
0.09
aine
0.09
serrat
0.08
ainer
0.08
aneous
0.08
aneously
0.08
ics
0.07
shire
0.07
iers
0.07
Activations Density 0.025%