INDEX
Explanations
phrases related to people and places, especially names
proper nouns, particularly names associated with specific individuals or locations
New Auto-Interp
Negative Logits
hardened
-0.70
psychic
-0.69
Gemini
-0.66
separation
-0.65
ACTED
-0.65
Chr
-0.64
mort
-0.62
respectful
-0.62
psy
-0.58
fateful
-0.58
POSITIVE LOGITS
aii
1.40
haw
1.40
keye
1.08
inson
1.04
intosh
0.98
daq
0.97
vine
0.96
awks
0.95
kins
0.93
igans
0.91
Activations Density 0.006%