INDEX
Explanations
numerical data and specific measurements related to weight, dimensions, or statistical changes
New Auto-Interp
Negative Logits
doz
-0.18
ninger
-0.16
ëĭ¤
-0.15
agli
-0.15
oser
-0.14
inz
-0.14
Ø·Ùģ
-0.14
rop
-0.14
SB
-0.13
اÙĨات
-0.13
POSITIVE LOGITS
)
0.33
]
0.25
}
0.24
à¥Ģ)
0.21
")
0.20
)
0.20
”)
0.19
à¹Į)
0.19
ा)
0.18
”
0.18
Activations Density 0.187%