INDEX
Explanations
years mentioned in a chronological context
New Auto-Interp
Negative Logits
January
-0.22
Winter
-0.21
January
-0.20
winter
-0.20
February
-0.19
JAN
-0.19
March
-0.18
Jan
-0.18
Christmas
-0.18
Winter
-0.17
POSITIVE LOGITS
itler
0.17
echan
0.16
.hardware
0.15
estival
0.15
Potion
0.14
ilter
0.14
ëħĦ
0.14
BUG
0.14
]=>
0.13
ãĥ³ãĥĹ
0.13
Activations Density 0.103%