INDEX
Explanations
names of individuals
names, particularly those of individuals involved in creative or public roles
New Auto-Interp
Negative Logits
mented
-0.80
dies
-0.76
grounds
-0.75
aves
-0.74
enced
-0.74
lled
-0.73
ague
-0.71
scape
-0.70
cision
-0.69
ez
-0.68
POSITIVE LOGITS
arios
0.92
aceous
0.87
istan
0.84
ãĥĦ
0.81
Span
0.76
Polo
0.74
Roose
0.71
omach
0.70
aido
0.70
inki
0.69
Activations Density 0.035%