INDEX
Explanations
names of specific individuals
names of notable individuals, particularly in a sports or political context
New Auto-Interp
Negative Logits
riors
-0.78
armac
-0.75
door
-0.73
tub
-0.73
river
-0.72
izoph
-0.72
´
-0.70
dream
-0.69
urban
-0.68
orney
-0.68
POSITIVE LOGITS
Gustav
0.82
ific
0.80
ensor
0.78
isl
0.78
IFIED
0.77
Kul
0.76
IFIC
0.75
atell
0.75
Luthor
0.75
prest
0.70
Activations Density 0.040%