INDEX
Explanations
phrases related to discouraging or dissuading others from certain actions
terms related to discouragement or dissuasion
New Auto-Interp
Negative Logits
ammy
-0.88
iop
-0.77
odore
-0.72
uben
-0.71
Nanto
-0.69
esome
-0.68
akening
-0.68
ophon
-0.68
lene
-0.68
thren
-0.68
POSITIVE LOGITS
ministic
0.96
discour
0.91
discouraged
0.86
discouraging
0.86
discourage
0.83
dissu
0.82
GGGGGGGG
0.74
vacc
0.71
minist
0.71
unwanted
0.71
Activations Density 0.017%