INDEX
Explanations
dates, specifically those related to the attacks on September 11
mentions of specific dates, particularly in September
New Auto-Interp
Negative Logits
holder
-0.76
Reviewer
-0.75
holders
-0.74
Interstitial
-0.72
stricken
-0.71
Hots
-0.64
Bubble
-0.63
crown
-0.61
ball
-0.61
jriwal
-0.61
POSITIVE LOGITS
alez
1.08
ibel
0.87
isco
0.86
hew
0.85
opus
0.84
gins
0.83
hens
0.83
ober
0.83
oss
0.82
olitics
0.82
Activations Density 0.005%