INDEX
Explanations
specific numerical values or measurements, particularly in scientific contexts
New Auto-Interp
Negative Logits
propOrder
-0.94
للمعارف
-0.94
ⓧ
-0.91
تقاوى
-0.91
समीक्षक
-0.90
LookAnd
-0.88
незавершена
-0.87
متعلقه
-0.85
tagext
-0.81
שוליים
-0.80
POSITIVE LOGITS
5
0.51
4
0.48
6
0.46
two
0.46
8
0.45
teen
0.44
two
0.43
inneren
0.43
heavy
0.41
9
0.41
Activations Density 0.535%