INDEX
Explanations
adjectives or verbs related to softening
references to "soft" concepts, indicating a focus on gentle or less aggressive approaches
New Auto-Interp
Negative Logits
ulhu
-0.76
reon
-0.73
agher
-0.72
Ancients
-0.71
Pax
-0.70
McKenna
-0.69
ICAN
-0.66
USS
-0.65
OUGH
-0.65
Blessed
-0.65
POSITIVE LOGITS
ening
1.18
ball
1.11
ener
1.09
hearted
1.01
palate
0.99
eners
0.98
ened
0.91
cover
0.89
ens
0.88
balls
0.88
Activations Density 0.016%