INDEX
Explanations
references to days of the week and their associated events or updates
New Auto-Interp
Negative Logits
they
-0.54
elsewhere
-0.52
and
-0.50
They
-0.49
k
-0.47
somebody
-0.46
kív
-0.46
in
-0.45
,
-0.45
They
-0.43
POSITIVE LOGITS
dagens
1.11
fevere
1.05
greateſt
1.03
ſelf
1.02
ſelves
1.02
Jefus
1.00
poffe
0.99
](#
0.97
ſtate
0.96
$_"
0.95
Activations Density 0.128%