INDEX
Explanations
names, particularly first and last names
references to specific names or terms associated with individuals
New Auto-Interp
Negative Logits
cytok
-0.69
destro
-0.68
pen
-0.67
stocking
-0.67
chim
-0.66
senal
-0.66
inval
-0.64
repe
-0.63
rers
-0.63
volt
-0.63
POSITIVE LOGITS
\\\\\\\\
0.97
uben
0.86
nesses
0.82
esley
0.79
eus
0.77
ica
0.77
heid
0.76
ITIES
0.74
orr
0.74
elfth
0.73
Activations Density 0.019%