INDEX
Explanations
phrases indicating the concept of permission or capability
New Auto-Interp
Negative Logits
scribed
-0.15
elves
-0.15
eme
-0.15
yt
-0.14
allis
-0.14
uddle
-0.14
erry
-0.14
emony
-0.14
uten
-0.13
dap
-0.13
POSITIVE LOGITS
us
0.20
fullscreen
0.16
sqlCommand
0.16
ances
0.14
ioned
0.14
961
0.14
orio
0.14
/disable
0.14
ance
0.14
-bodied
0.14
Activations Density 0.056%