INDEX
Explanations
phrases related to promoting or advertising something
New Auto-Interp
Negative Logits
quartered
-0.76
sbm
-0.69
*/(
-0.68
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.68
assian
-0.66
fuck
-0.66
falls
-0.66
Detected
-0.66
asso
-0.65
kes
-0.65
POSITIVE LOGITS
awareness
0.88
abstinence
0.83
andise
0.81
entious
0.80
entrepreneurship
0.80
equality
0.79
democracy
0.79
intolerance
0.78
excellence
0.78
ably
0.77
Activations Density 0.070%