INDEX
Explanations
names of individuals involved in specific events or stories
New Auto-Interp
Negative Logits
isSpecialOrderable
-0.69
URA
-0.66
polar
-0.60
licens
-0.60
Sorce
-0.58
showc
-0.58
apore
-0.57
newsp
-0.57
pree
-0.56
facing
-0.55
POSITIVE LOGITS
reau
0.85
loe
0.81
Ferry
0.80
oola
0.80
illet
0.75
wana
0.74
tein
0.74
acci
0.73
Aerospace
0.71
rice
0.70
Activations Density 0.501%