INDEX
Explanations
words related to a lack of empathy or understanding towards others
terms related to insensitivity or lack of consideration
New Auto-Interp
Negative Logits
llan
-0.67
ries
-0.64
Scotia
-0.60
Chaser
-0.58
Loving
-0.56
Older
-0.56
liking
-0.56
err
-0.56
Lamar
-0.56
ãĥ£
-0.55
POSITIVE LOGITS
istent
1.32
urances
1.26
pite
1.23
ensitivity
1.21
urable
1.16
uffer
1.15
ides
1.15
ensible
1.13
ourcing
1.13
criptions
1.11
Activations Density 0.009%