INDEX
Explanations
mentions of the media organization "ABC"
mentions of the ABC network
New Auto-Interp
Negative Logits
irts
-0.77
wagen
-0.72
amoto
-0.71
Hiroshima
-0.70
boarding
-0.69
vik
-0.68
agues
-0.67
ainer
-0.65
crop
-0.65
umatic
-0.65
POSITIVE LOGITS
DEF
1.37
NEWS
0.87
IRO
0.85
ABC
0.81
nect
0.77
Mub
0.76
IS
0.76
DF
0.75
ABC
0.75
ERG
0.72
Activations Density 0.005%