INDEX
Explanations
names related to sports or individuals
proper nouns and names associated with influential people and entities
New Auto-Interp
Negative Logits
HI
-0.68
Wonderland
-0.64
mouse
-0.64
pasture
-0.63
gestation
-0.63
unden
-0.63
Race
-0.62
psychiat
-0.62
Shell
-0.60
Dele
-0.59
POSITIVE LOGITS
etus
0.80
Äĩ
0.80
unic
0.73
ilver
0.73
igham
0.72
jen
0.70
agi
0.68
boa
0.67
eri
0.66
udic
0.66
Activations Density 0.505%