INDEX
Explanations
expressions of desire or intent
New Auto-Interp
Negative Logits
tein
-0.80
guiActiveUn
-0.75
poons
-0.71
trump
-0.67
_-_
-0.66
akedown
-0.66
agram
-0.65
�
-0.65
ILCS
-0.64
quickShipAvailable
-0.64
POSITIVE LOGITS
consistency
0.80
satisfactory
0.78
improved
0.72
cellent
0.72
stronger
0.70
consistent
0.69
better
0.69
fresh
0.69
itiveness
0.68
simpler
0.67
Activations Density 0.053%