INDEX
Explanations
phrases indicating uncertainty or indecisiveness
instances of negation or expressions of doubt
New Auto-Interp
Negative Logits
Palest
-0.84
Interstitial
-0.76
ãĥ¯ãĥ³
-0.75
Buyable
-0.71
Desk
-0.67
è£ħ
-0.66
çͰ
-0.65
Sabha
-0.65
Kap
-0.64
ESSION
-0.64
POSITIVE LOGITS
âĢķ
0.86
£
0.76
ump
0.76
¢
0.74
ĺ
0.72
¼
0.70
Ķ
0.70
¡
0.69
º
0.67
¿
0.66
Activations Density 0.329%