INDEX
Explanations
words related to social embarrassment or shame
instances of the substring "Emb" followed by various suffixes
New Auto-Interp
Negative Logits
wagen
-0.84
ãĥīãĥ©
-0.76
creen
-0.74
gers
-0.69
obsc
-0.67
è£ı
-0.66
ãĥĥãĥĪ
-0.66
heast
-0.63
å§«
-0.63
culosis
-0.63
POSITIVE LOGITS
arrass
1.45
edded
1.31
odied
1.27
assies
1.16
attled
1.09
argo
1.07
assy
1.05
odies
1.05
edd
1.00
olicy
0.94
Activations Density 0.059%