INDEX
Explanations
numeric symbols and special characters in the text
New Auto-Interp
Negative Logits
FRING
-0.17
bras
-0.15
arias
-0.15
ativos
-0.15
plusplus
-0.15
arding
-0.14
ativo
-0.14
issors
-0.14
ativa
-0.14
Orth
-0.14
POSITIVE LOGITS
zi
0.18
ETA
0.15
zung
0.15
IMA
0.15
ETS
0.14
Freed
0.14
trys
0.14
spir
0.14
208
0.13
accord
0.13
Activations Density 0.042%