INDEX
Explanations
words ending with "-ers"
terms that refer to groups of people or classifications
New Auto-Interp
Negative Logits
ADRA
-0.72
SIGN
-0.71
forfeiture
-0.69
URRENT
-0.65
SPONSORED
-0.62
ournal
-0.62
PUT
-0.62
WR
-0.61
ISON
-0.61
conviction
-0.61
POSITIVE LOGITS
mith
1.22
pace
1.12
paces
1.10
ucker
1.06
linger
1.05
haw
1.00
cream
0.97
ystem
0.96
peed
0.95
erker
0.91
Activations Density 0.045%