INDEX
Explanations
mentions of news media outlets and reports
New Auto-Interp
Negative Logits
inho
-0.77
Reson
-0.69
ayne
-0.69
pired
-0.66
Wade
-0.65
BuyableInstoreAndOnline
-0.63
vasive
-0.62
ensions
-0.62
Stark
-0.60
Stupid
-0.60
POSITIVE LOGITS
room
1.00
reader
0.94
orial
0.93
coverage
0.91
Coverage
0.91
headlines
0.90
eval
0.89
Reporting
0.89
flash
0.88
outlets
0.86
Activations Density 3.691%