INDEX
Explanations
words related to strong emotional reactions such as tears
references to emotional expressions, particularly tears
New Auto-Interp
Negative Logits
stood
-0.70
ancial
-0.69
offic
-0.67
ENCY
-0.64
Trend
-0.64
Senegal
-0.62
Ü
-0.61
Notable
-0.61
enture
-0.60
pmwiki
-0.60
POSITIVE LOGITS
tears
0.99
bows
0.98
stals
0.84
Tears
0.84
stained
0.83
terday
0.82
fters
0.79
mith
0.77
bow
0.75
beads
0.75
Activations Density 0.010%