INDEX
Explanations
statements about personal consistency and identity
New Auto-Interp
Negative Logits
eus
-0.18
ussen
-0.15
aktu
-0.15
δÎŃ
-0.14
aget
-0.14
e
-0.14
iesel
-0.14
yles
-0.14
soon
-0.14
olla
-0.13
POSITIVE LOGITS
urus
0.15
ORA
0.14
είο
0.14
ozor
0.14
šlo
0.14
Ø·Ùģ
0.13
ãģĤãĤĬãģĮãģ¨ãģĨ
0.13
Jaw
0.13
ynchronous
0.13
ysi
0.13
Activations Density 0.092%