INDEX
Explanations
phrases related to giving commands or directives
New Auto-Interp
Negative Logits
ģĸ
-0.69
20439
-0.67
Combined
-0.67
ļéĨĴ
-0.64
purported
-0.61
ancest
-0.59
features
-0.58
culminating
-0.58
Vendor
-0.58
strikingly
-0.58
POSITIVE LOGITS
yourselves
1.45
yourself
1.10
thy
1.00
your
0.94
ye
0.93
ya
0.92
Yourself
0.89
thou
0.88
me
0.86
fuckin
0.84
Activations Density 0.281%