INDEX
Explanations
references to academic authors and their contributions
New Auto-Interp
Negative Logits
setAll
-0.59
अलावा
-0.57
Toten
-0.56
bný
-0.54
✪
-0.52
Trost
-0.52
And
-0.51
poš
-0.51
ėl
-0.50
ربعة
-0.50
POSITIVE LOGITS
JAS
1.33
Jamb
1.26
Jot
1.24
jLabel
1.22
jc
1.19
Jop
1.17
Jit
1.17
JAS
1.17
JE
1.14
JMP
1.14
Activations Density 1.299%