INDEX
Explanations
statements of diagnosis, observation, and critical assessment
New Auto-Interp
Negative Logits
lein
-0.15
native
-0.15
Gross
-0.14
渡
-0.14
ä¼¼
-0.14
Leone
-0.14
swick
-0.14
arpa
-0.14
àµįà´
-0.14
ivec
-0.14
POSITIVE LOGITS
ãĤ¸ãĤ¢
0.17
seys
0.15
apore
0.15
imers
0.15
appers
0.14
zung
0.14
pora
0.14
ilden
0.14
BC
0.14
aN
0.13
Activations Density 0.227%