INDEX
Explanations
phrases indicating actions taken without consent or permission
phrases related to actions taken without consent or permission
New Auto-Interp
Negative Logits
raq
-0.79
Reincarnated
-0.78
ãĤ¼ãĤ¦ãĤ¹
-0.73
Kinnikuman
-0.69
arse
-0.69
è¦ļéĨĴ
-0.66
ãĤ¤ãĥĪ
-0.66
ä¸ī
-0.66
romy
-0.65
ãĥ³ãĤ¸
-0.64
POSITIVE LOGITS
permission
1.25
authorization
1.22
consent
1.12
exception
1.03
prompting
0.98
approval
0.97
qualification
0.96
supervision
0.96
specifying
0.94
reperc
0.94
Activations Density 0.096%