INDEX
Explanations
domain endings and punctuation
New Auto-Interp
Negative Logits
См
0.42
CHREIB
0.40
ُونَ
0.38
鮋
0.38
ಪ
0.37
。《
0.37
Ⲛ
0.37
selfishness
0.36
क्लेव
0.36
}$&$-
0.35
POSITIVE LOGITS
,
0.55
/
0.45
',
0.44
’,
0.43
",
0.40
”,
0.40
Berg
0.40
Allen
0.39
;
0.38
glacial
0.37
Activations Density 0.000%