INDEX
Explanations
names or terms related to various locations
proper nouns, specifically names of individuals and places, as well as references to entities
New Auto-Interp
Negative Logits
odium
-0.77
AMS
-0.67
SAP
-0.66
GREEN
-0.63
rack
-0.61
iox
-0.61
Logged
-0.61
barr
-0.61
adel
-0.60
intermitt
-0.59
POSITIVE LOGITS
kun
1.98
arest
1.71
Claus
1.65
Pluto
1.53
Satan
1.37
Dracula
1.25
Psycho
1.18
psycho
1.18
anova
1.16
caus
1.16
Activations Density 0.044%