INDEX
Explanations
dates related to historical events, specifically focusing on September 11th
mentions of specific dates, particularly in September
New Auto-Interp
Negative Logits
hepat
-0.66
shr
-0.64
ying
-0.62
kus
-0.61
Interstitial
-0.61
transform
-0.60
bleach
-0.60
shake
-0.59
UGH
-0.59
jri
-0.59
POSITIVE LOGITS
2018
0.92
29
0.88
Ago
0.86
27
0.85
11
0.85
eteenth
0.83
1862
0.82
26
0.82
28
0.80
2013
0.80
Activations Density 0.022%