INDEX
Explanations
adjectives describing something positively or favorably
the recurring use of the word "pretty" to indicate a positive sentiment or evaluation
New Auto-Interp
Negative Logits
venant
-0.85
pent
-0.77
arta
-0.72
upon
-0.71
FTA
-0.69
ossession
-0.69
APD
-0.69
allas
-0.68
iments
-0.68
herry
-0.67
POSITIVE LOGITS
darn
1.30
nifty
0.98
damn
0.91
much
0.88
harmless
0.87
neat
0.83
tasty
0.83
damned
0.83
handy
0.82
cool
0.81
Activations Density 0.018%