INDEX
Explanations
instances where permission is discussed or sought
references to the concept of permission
New Auto-Interp
Negative Logits
ties
-0.71
enegger
-0.65
wed
-0.64
addle
-0.64
-0.63
vae
-0.62
nik
-0.60
ran
-0.60
ensen
-0.59
opic
-0.59
POSITIVE LOGITS
slips
0.99
Reviewer
0.91
permission
0.91
authorizing
0.89
required
0.87
slip
0.86
granted
0.81
permissions
0.79
approvals
0.79
waivers
0.78
Activations Density 0.075%