INDEX
Explanations
names related to influential individuals
proper names, particularly those of individuals associated with specific actions or contexts
New Auto-Interp
Negative Logits
éĹĺ
-0.98
izations
-0.82
heast
-0.82
ufact
-0.78
xual
-0.77
psons
-0.77
arily
-0.74
coup
-0.74
redits
-0.74
mingham
-0.72
POSITIVE LOGITS
Hub
0.87
iasis
0.77
Malfoy
0.71
asar
0.69
Feinstein
0.68
adium
0.68
aho
0.67
glare
0.67
Wan
0.67
Sheldon
0.67
Activations Density 0.035%