INDEX
Explanations
dates and locations from news articles
dates and geographic locations
New Auto-Interp
Negative Logits
berries
-0.65
hiber
-0.61
typo
-0.60
impulse
-0.59
tunes
-0.59
dotted
-0.59
bench
-0.58
maiden
-0.58
fools
-0.58
sucker
-0.57
POSITIVE LOGITS
REUTERS
1.02
Reuters
0.72
UTERS
0.69
ASS
0.66
Mandatory
0.65
Picture
0.65
Sponsor
0.63
Actor
0.62
AFP
0.61
Reuters
0.61
Activations Density 0.165%