INDEX
Explanations
references to specific news organizations
mentions of specific media organizations and personalities
New Auto-Interp
Negative Logits
chio
-0.71
palms
-0.70
succession
-0.66
duty
-0.66
ramid
-0.64
ounter
-0.64
creen
-0.63
orney
-0.63
complexes
-0.62
ABE
-0.60
POSITIVE LOGITS
OPLE
0.82
Verge
0.78
erd
0.75
Spread
0.74
auld
0.73
Thiel
0.73
llor
0.72
âĸ¬
0.70
ly
0.69
ously
0.68
Activations Density 0.071%