INDEX
Explanations
statements related to decision-making and commitments
New Auto-Interp
Negative Logits
الرياضيه
-0.60
Ỏ
-0.51
تضيفلها
-0.48
awtextra
-0.47
byes
-0.45
OGND
-0.45
derd
-0.45
eezy
-0.44
bilir
-0.43
autorytatywna
-0.43
POSITIVE LOGITS
awaiter
0.65
prochain
0.59
judiciales
0.59
umge
0.58
Carreira
0.58
culturelles
0.58
judicia
0.57
fiscales
0.57
prochaine
0.56
|}\
0.56
Activations Density 0.326%