INDEX
Explanations
the word "penchant"
expressions of strong preferences or tendencies
New Auto-Interp
Negative Logits
izoph
-0.72
arya
-0.68
Died
-0.64
paran
-0.64
Shelter
-0.64
agements
-0.64
sshd
-0.63
eared
-0.61
omaly
-0.61
ÃŃs
-0.61
POSITIVE LOGITS
Chip
0.68
gie
0.68
maps
0.67
Portland
0.66
hots
0.65
cape
0.65
fortunes
0.65
Corn
0.64
Cotton
0.64
STEM
0.63
Activations Density 0.043%