INDEX
Explanations
expressions related to deserving and entitlement
New Auto-Interp
Negative Logits
essler
-0.19
orton
-0.15
amera
-0.15
znam
-0.15
proh
-0.15
lain
-0.14
edic
-0.14
DataProvider
-0.14
ode
-0.14
xAF
-0.14
POSITIVE LOGITS
deserve
0.22
consideration
0.20
ably
0.20
Pun
0.19
deserves
0.19
ÑģÑĤаÑĢи
0.16
credit
0.16
better
0.15
recognition
0.15
better
0.15
Activations Density 0.028%