INDEX
Explanations
phrases indicating willingness or readiness to engage in actions or commitments
New Auto-Interp
Negative Logits
onda
-0.18
omaly
-0.16
abus
-0.16
CRET
-0.16
uppe
-0.15
abilidad
-0.15
AssemblyCopyright
-0.15
Ability
-0.15
raq
-0.14
ability
-0.14
POSITIVE LOGITS
accepting
0.27
accept
0.25
accepts
0.24
accept
0.21
Accept
0.20
æİ¥åıĹ
0.20
Accept
0.19
admitting
0.19
let
0.18
admit
0.18
Activations Density 0.127%