INDEX
Explanations
proper nouns related to news or media
mentions of the letter 'M'
New Auto-Interp
Negative Logits
Wolves
-0.69
thumbs
-0.67
pony
-0.67
Mane
-0.66
firsthand
-0.65
exacerbated
-0.64
hottest
-0.64
sto
-0.63
Bil
-0.63
undercut
-0.63
POSITIVE LOGITS
ountain
1.39
umbai
1.33
useum
1.32
unicip
1.30
ikhail
1.29
arijuana
1.24
ovies
1.23
eredith
1.22
igration
1.21
ISSION
1.21
Activations Density 0.041%