INDEX
Explanations
instances of refusal or reluctance to take action
New Auto-Interp
Negative Logits
ysi
-0.16
.scalablytyped
-0.16
ká»ĭp
-0.15
ertest
-0.14
hlen
-0.14
réuss
-0.14
ëĵĿ
-0.14
.pix
-0.14
xbf
-0.14
crm
-0.14
POSITIVE LOGITS
let
0.26
accept
0.26
cooperate
0.26
allow
0.26
entertain
0.26
accepts
0.24
accepting
0.23
part
0.23
cooperation
0.23
entertaining
0.23
Activations Density 0.182%