INDEX
Explanations
proper nouns related to news events or people (e.g., names of individuals and organizations)
references to political affiliations or left/right ideological positions
New Auto-Interp
Negative Logits
sed
-0.72
enos
-0.71
glas
-0.67
general
-0.67
gress
-0.66
è¦ļéĨĴ
-0.65
apo
-0.65
acious
-0.64
iom
-0.63
ominated
-0.63
POSITIVE LOGITS
pictured
0.91
Thumbnails
0.81
terday
0.76
*/(
0.72
flanked
0.72
allery
0.72
Uriel
0.71
ngth
0.71
portraits
0.71
IMAGES
0.70
Activations Density 0.026%