INDEX
Explanations
words related to the concept of love
expressions of love and affection
New Auto-Interp
Negative Logits
SPONSORED
-0.85
HEAD
-0.73
Dull
-0.69
Restrict
-0.69
Skydragon
-0.67
Brookings
-0.66
METHOD
-0.66
Christensen
-0.66
restricted
-0.66
RD
-0.65
POSITIVE LOGITS
love
3.60
LOVE
2.85
love
2.65
Love
2.30
loves
2.25
Love
2.17
loved
2.07
loving
2.04
adore
2.02
passion
1.72
Activations Density 0.026%