INDEX
Explanations
usage of transitional phrases that indicate contrast or exceptions
New Auto-Interp
Negative Logits
zeÅĦ
-0.16
Halk
-0.14
.hw
-0.14
isay
-0.14
istrovstvÃŃ
-0.13
mî
-0.13
ocre
-0.13
ilha
-0.13
ouce
-0.13
uces
-0.13
POSITIVE LOGITS
/or
0.16
lem
0.14
âĤ¬“
0.14
.infinity
0.14
atr
0.13
езда
0.13
bordel
0.13
verts
0.13
ÅĽ
0.12
umi
0.12
Activations Density 0.177%