INDEX
Explanations
news and information-related terms or topics
references to news stories and their significance
New Auto-Interp
Negative Logits
ependence
-0.80
emale
-0.78
gger
-0.77
gee
-0.77
acquaintance
-0.76
othal
-0.74
ajor
-0.74
alus
-0.73
ggie
-0.73
cki
-0.73
POSITIVE LOGITS
Article
0.93
trending
0.87
Content
0.86
article
0.85
Interstitial
0.78
headlines
0.78
alerts
0.75
Trend
0.74
Stories
0.74
Torrent
0.73
Activations Density 0.028%