INDEX
Explanations
punctuation and formatting elements in the text
New Auto-Interp
Negative Logits
hdessä
-0.47
læ
-0.46
sondern
-0.44
fohl
-0.44
læ
-0.43
LayoutConstraint
-0.41
Successfully
-0.41
acyjny
-0.41
olella
-0.41
akujem
-0.41
POSITIVE LOGITS
tamen
1.21
nevertheless
1.15
nonetheless
1.12
yet
1.10
Pourtant
1.03
still
1.03
Nonetheless
1.01
pourtant
1.01
Nonetheless
1.01
Nevertheless
0.96
Activations Density 0.108%