INDEX
Explanations
negative qualifiers and expressions of capability or existence
New Auto-Interp
Negative Logits
ambda
-0.16
opot
-0.16
gear
-0.15
gear
-0.15
uo
-0.15
arel
-0.14
icz
-0.14
hue
-0.14
mmo
-0.14
owell
-0.14
POSITIVE LOGITS
иденÑĤ
0.16
Stevenson
0.15
argin
0.15
vat
0.15
aky
0.15
CFG
0.14
yal
0.14
Variant
0.14
ya
0.14
ç§ĭ
0.14
Activations Density 0.358%