INDEX
Explanations
references to television news channels
references to television networks or broadcasts
New Auto-Interp
Negative Logits
ussen
-0.73
0000000000000000
-0.73
hammad
-0.69
hend
-0.69
cific
-0.68
leans
-0.68
tha
-0.67
strument
-0.66
otherwise
-0.66
notations
-0.65
POSITIVE LOGITS
ILLE
1.26
Everywhere
0.88
trop
0.86
NZ
0.84
Broadcasting
0.83
iolet
0.83
OS
0.80
NG
0.80
ISION
0.77
PG
0.74
Activations Density 0.021%