INDEX
Explanations
phrases related to instructions or directives
phrases emphasizing the concept of "keeping" or maintaining something
New Auto-Interp
Negative Logits
ĪĴ
-0.77
bernatorial
-0.74
ALLY
-0.74
haps
-0.70
DIT
-0.69
BAT
-0.69
esar
-0.66
KA
-0.65
PRESIDENT
-0.65
JA
-0.65
POSITIVE LOGITS
afloat
1.06
footing
0.84
alive
0.78
wraps
0.77
composure
0.74
readable
0.72
lid
0.70
momentum
0.70
tabs
0.69
reins
0.69
Activations Density 0.108%