INDEX
Explanations
phrases expressing strong emotions, particularly negative ones like shame, disgust, and anger
words related to condemnation or criticism
New Auto-Interp
Negative Logits
lumber
-0.72
brass
-0.69
Tanz
-0.69
complement
-0.68
fuller
-0.66
stocked
-0.64
cart
-0.64
stocking
-0.62
vantage
-0.62
complementary
-0.62
POSITIVE LOGITS
ful
1.27
fully
1.26
fulness
1.18
FUL
1.17
ingly
1.05
ously
1.00
ous
0.99
bringer
0.96
mong
0.94
lessly
0.93
Activations Density 0.223%