INDEX
Explanations
instances where someone gives permission or authorization
phrases indicating permission or consent
New Auto-Interp
Negative Logits
Eater
-0.51
sear
-0.50
Fram
-0.50
\":
-0.50
ï¸ı
-0.49
Nielsen
-0.46
yssey
-0.46
Journal
-0.45
Serial
-0.45
ceans
-0.44
POSITIVE LOGITS
subordinate
0.68
sacrific
0.65
freely
0.64
uninterrupted
0.63
unim
0.62
withdrawn
0.61
subord
0.60
ardless
0.58
unrestricted
0.57
uncond
0.56
Activations Density 0.951%