INDEX
Explanations
phrases indicating possession or self-reference
New Auto-Interp
Negative Logits
MLLoader
-1.21
Datuak
-1.07
^(@)
-1.03
NUMX
-0.97
насељу
-0.96
kháu
-0.95
ſind
-0.94
клопе
-0.91
"..\..\
-0.91
Хьажоргаш
-0.91
POSITIVE LOGITS
.
0.66
'
0.61
,
0.61
’
0.60
↵
0.55
0.55
bu
0.53
and
0.53
The
0.53
-
0.52
Activations Density 1.564%