INDEX
Explanations
phrases related to actions taken without authorization or consent
instances of unauthorized actions or lack of consent
New Auto-Interp
Negative Logits
NetMessage
-0.73
esa
-0.73
ãĥĺ
-0.70
âĢº
-0.68
culture
-0.67
cro
-0.67
cephal
-0.66
phis
-0.66
ahime
-0.65
emb
-0.64
POSITIVE LOGITS
permission
1.26
authorization
1.25
consent
1.18
informing
1.08
explicit
1.01
approval
0.98
notification
0.97
Consent
0.95
parental
0.95
notice
0.92
Activations Density 0.112%