INDEX
Explanations
questions about possibility and permission
New Auto-Interp
Negative Logits
aye
-0.16
Raq
-0.16
lse
-0.15
pest
-0.15
agate
-0.14
antine
-0.14
osoph
-0.14
fel
-0.14
ucene
-0.14
iano
-0.13
POSITIVE LOGITS
Truthy
0.17
Kirk
0.15
Pack
0.15
ãĥ¥
0.14
inx
0.14
Fr
0.14
Broadway
0.14
à¥Į
0.14
CEPTION
0.13
atif
0.13
Activations Density 0.088%