INDEX
Explanations
proper nouns related to locations and names
New Auto-Interp
Negative Logits
ishers
-0.72
arty
-0.63
ered
-0.62
acts
-0.61
ship
-0.61
ries
-0.60
ously
-0.59
istically
-0.59
istic
-0.58
ghai
-0.57
POSITIVE LOGITS
peed
1.28
aurus
1.26
ync
1.25
sein
1.23
hip
1.22
ystem
1.21
mith
1.20
ource
1.20
CRIP
1.19
erver
1.18
Activations Density 1.471%