INDEX
Explanations
instances of dates or numerical information
New Auto-Interp
Negative Logits
January
-0.20
February
-0.20
May
-0.20
March
-0.17
August
-0.17
November
-0.17
January
-0.16
December
-0.16
olle
-0.16
chine
-0.16
POSITIVE LOGITS
NO
0.28
NO
0.27
DEC
0.22
DEC
0.21
-de
0.19
nov
0.18
ober
0.17
ONGO
0.17
DE
0.17
nov
0.17
Activations Density 0.101%