INDEX
Explanations
instances of tear-related words
references to tears and tear-related emotions or actions
New Auto-Interp
Negative Logits
eton
-0.72
enhagen
-0.72
abilities
-0.72
ioned
-0.68
ITNESS
-0.68
zsche
-0.67
alty
-0.67
POL
-0.66
Û
-0.66
REDACTED
-0.66
POSITIVE LOGITS
adoes
1.11
ful
1.09
fully
0.96
duct
0.93
bows
0.92
gas
0.89
stained
0.88
fulness
0.88
ados
0.88
adic
0.86
Activations Density 0.047%