INDEX
Explanations
dates in the format "February DD"
references to the month of February and related dates
New Auto-Interp
Negative Logits
cumbers
-0.86
ullivan
-0.73
eteria
-0.70
ioch
-0.69
ocratic
-0.68
awaru
-0.67
ographed
-0.66
ollow
-0.65
chwitz
-0.64
alach
-0.64
POSITIVE LOGITS
2019
1.00
2015
0.92
2017
0.90
nd
0.88
2016
0.88
2021
0.87
2018
0.85
ruary
0.84
Madness
0.82
2024
0.82
Activations Density 0.011%