INDEX
Explanations
dates, specifically related to the month of October
New Auto-Interp
Negative Logits
aug
-0.16
lement
-0.16
WithOptions
-0.16
Apr
-0.16
Sommer
-0.16
Feb
-0.15
lems
-0.15
ledon
-0.15
Easter
-0.15
ingleton
-0.15
POSITIVE LOGITS
-Nov
0.28
fest
0.27
tober
0.25
/oct
0.24
ober
0.23
份
0.23
Surprise
0.22
ubre
0.22
avia
0.20
oct
0.20
Activations Density 0.021%