INDEX
Explanations
specific dates and numerical references in the text
New Auto-Interp
Negative Logits
adge
-0.17
-pane
-0.16
Sat
-0.15
vala
-0.15
annon
-0.15
çĵľ
-0.14
vÃŃc
-0.14
chodu
-0.14
rosse
-0.14
erro
-0.14
POSITIVE LOGITS
ok
0.43
j
0.33
fe
0.32
ok
0.32
.ok
0.30
juni
0.28
_ok
0.28
Ok
0.27
maj
0.27
okay
0.26
Activations Density 0.027%