INDEX
Explanations
references to "heart" and related emotional concepts
New Auto-Interp
Negative Logits
emento
-0.15
ment
-0.14
ess
-0.14
diffuse
-0.14
ries
-0.14
ç¿°
-0.14
y
-0.14
ous
-0.14
eries
-0.14
hook
-0.13
POSITIVE LOGITS
strings
0.23
edly
0.20
beat
0.19
broken
0.19
rending
0.19
-shaped
0.18
wood
0.17
felt
0.17
Beat
0.17
warming
0.17
Activations Density 0.041%