INDEX
Explanations
words related to people's names
occurrences of the word "on."
New Auto-Interp
Negative Logits
recomp
-0.64
compe
-0.62
eas
-0.60
ITNESS
-0.59
TPS
-0.59
âĵĺ
-0.58
thumbnail
-0.58
RESULTS
-0.58
Immunity
-0.58
specificity
-0.58
POSITIVE LOGITS
nen
1.27
auts
1.15
autical
1.14
etheless
1.09
cé
1.06
nect
1.05
ews
1.00
arios
0.98
aut
0.98
etta
0.96
Activations Density 0.062%