INDEX
Explanations
words related to dates and months, particularly looking for mentions of specific days of the month
New Auto-Interp
Negative Logits
éĹĺ
-0.72
culosis
-0.68
ngth
-0.67
FUL
-0.64
unfor
-0.60
RIS
-0.58
Sunder
-0.58
fore
-0.57
witnessing
-0.57
segreg
-0.56
POSITIVE LOGITS
ibel
1.02
ried
0.97
iors
0.95
aqu
0.92
onna
0.90
imal
0.90
arate
0.88
ice
0.86
ital
0.84
av
0.83
Activations Density 0.431%