INDEX
Explanations
adjectives related to morality and judgment
terms related to social and ethical judgments about actions and practices
Adjectives related to morality and judgment
Explanation Uploaded by User
New Auto-Interp
Negative Logits
onut
-0.69
ahs
-0.60
zar
-0.59
agine
-0.58
andon
-0.58
aper
-0.57
pesky
-0.57
guyen
-0.56
Boot
-0.56
rollers
-0.56
POSITIVE LOGITS
enough
1.11
enough
0.83
Enough
0.77
against
0.75
ãĥ¼ãĥĨ
0.74
izable
0.74
nell
0.74
isable
0.72
compared
0.71
ãĥ¼ãĤ¯
0.71
Activations Density 0.289%