INDEX
Explanations
the format of a command or instruction
themes related to societal rules and critiques of authority
New Auto-Interp
Negative Logits
caveats
-0.60
ortium
-0.59
caveat
-0.56
rede
-0.53
BuyableInstoreAndOnline
-0.52
unanimous
-0.51
peaked
-0.50
reiterate
-0.50
notable
-0.50
unden
-0.49
POSITIVE LOGITS
.[
0.95
.</
0.94
.?
0.92
.",
0.92
.''.
0.90
.ãĢį
0.89
.
0.88
.''
0.87
.;
0.86
.","
0.86
Activations Density 0.855%