INDEX
Explanations
adjectives related to softness
repeated mentions of the word "soft."
New Auto-Interp
Negative Logits
ulhu
-0.78
reon
-0.78
USS
-0.74
ITNESS
-0.70
Chronicles
-0.69
Ancients
-0.69
McKenna
-0.68
Emir
-0.68
Pax
-0.67
ICAN
-0.67
POSITIVE LOGITS
ball
1.13
ening
1.10
hearted
1.09
palate
1.07
ener
1.02
balls
0.93
cover
0.89
est
0.87
ened
0.85
heart
0.83
Activations Density 0.013%