INDEX
Explanations
news headlines containing specific dates or numbers
topics related to health and wellness
New Auto-Interp
Negative Logits
armour
-0.90
scrap
-0.78
cones
-0.77
colours
-0.75
pubs
-0.73
organised
-0.72
isot
-0.72
licences
-0.72
cone
-0.72
dra
-0.71
POSITIVE LOGITS
Enlarge
1.49
NPR
1.10
Tweet
0.92
ccording
0.92
WASHINGTON
0.91
POLITICO
0.89
Posted
0.88
ANK
0.84
toggle
0.83
********************************
0.83
Activations Density 0.132%