INDEX
Explanations
phrases related to consent and obligation
New Auto-Interp
Negative Logits
еви
-0.15
ç°
-0.15
oret
-0.14
ctxt
-0.14
respons
-0.13
åħģ
-0.13
ToLeft
-0.13
aret
-0.13
олее
-0.13
PubMed
-0.13
POSITIVE LOGITS
somehow
0.25
indirect
0.22
portions
0.19
indirectly
0.19
hoped
0.18
equivalent
0.18
otherwise
0.18
wished
0.18
partly
0.17
sorts
0.17
Activations Density 0.023%