INDEX
Explanations
occurrences of punctuation marks, particularly commas
New Auto-Interp
Negative Logits
ſche
-0.63
plufieurs
-0.57
pinulongan
-0.56
ainfi
-0.55
tartalomajánló
-0.54
ſch
-0.54
deſt
-0.53
auffi
-0.51
Réponses
-0.49
raiſ
-0.49
POSITIVE LOGITS
nahilalakip
0.52
on
0.49
at
0.47
with
0.46
by
0.45
Савезне
0.44
in
0.44
ably
0.42
onto
0.42
awtextra
0.42
Activations Density 0.009%