INDEX
Explanations
mentions of political figures and organizations in news articles
New Auto-Interp
Negative Logits
©¶æ¥µ
-0.73
CONCLUS
-0.66
£ı
-0.62
Chapters
-0.61
¥µ
-0.60
phas
-0.58
TL
-0.58
Table
-0.57
trivial
-0.57
Ͻ
-0.56
POSITIVE LOGITS
Photographer
1.10
REUTERS
1.09
window
1.06
Photo
1.05
Photograph
1.03
Courtesy
1.01
PHOTO
1.00
photo
1.00
IMAGES
0.98
reenshot
0.94
Activations Density 1.757%