INDEX
Explanations
adjectives describing something positively
the word "nice" and its variations or contexts that imply positivity
New Auto-Interp
Negative Logits
ogens
-0.74
onics
-0.73
rules
-0.72
uilding
-0.71
orders
-0.70
authorized
-0.70
ationally
-0.69
reports
-0.68
ivals
-0.66
bin
-0.66
POSITIVE LOGITS
touch
0.99
touches
0.97
sounding
0.89
neat
0.88
little
0.87
nice
0.83
gesture
0.83
smelling
0.82
warm
0.82
fluffy
0.80
Activations Density 0.055%