INDEX
Explanations
names or entities related to individuals, especially when combined with actions or locations
proper names, particularly of individuals
New Auto-Interp
Negative Logits
ctors
-0.73
aneously
-0.67
Presidents
-0.66
acts
-0.65
cers
-0.63
presidents
-0.63
fare
-0.63
ously
-0.63
istically
-0.62
condition
-0.62
POSITIVE LOGITS
mith
1.56
hift
1.42
peed
1.34
chool
1.32
hip
1.32
hirt
1.32
heet
1.30
pring
1.30
erver
1.30
creen
1.29
Activations Density 0.186%