INDEX
Explanations
mentions of the word "Channel"
mentions of specific television channels and news programs
mentions of television channels
New Auto-Interp
Negative Logits
earable
-0.70
olding
-0.70
arcity
-0.67
ever
-0.67
sund
-0.64
Rao
-0.64
Oilers
-0.64
sson
-0.64
nexpected
-0.62
ĻĤ
-0.61
POSITIVE LOGITS
Channel
1.13
Channel
1.11
channel
0.88
Divinity
0.86
Mask
0.86
Tunnel
0.83
Islands
0.82
annels
0.79
channel
0.78
icut
0.77
Activations Density 0.008%