INDEX
Explanations
occurrences of years in the text
New Auto-Interp
Negative Logits
ambi
-0.16
linger
-0.16
rror
-0.15
stock
-0.15
iros
-0.14
malink
-0.14
anı
-0.13
correct
-0.13
ningar
-0.13
elix
-0.13
POSITIVE LOGITS
rome
0.15
?action
0.15
ãĥªãĤ¹
0.15
idos
0.14
sov
0.14
ispers
0.14
CharArray
0.14
Gib
0.14
ILLISE
0.14
iswa
0.13
Activations Density 0.044%