INDEX
Explanations
the word "scapegoat" and variations of it
terms related to blame and responsibility, particularly in the context of scapegoating
New Auto-Interp
Negative Logits
Dragonbound
-0.82
Annotations
-0.75
nutshell
-0.69
é¾įå¥ij士
-0.66
Franch
-0.64
complicity
-0.63
nexus
-0.63
Improvement
-0.62
ACTION
-0.62
Origin
-0.61
POSITIVE LOGITS
eling
1.12
·
1.04
ards
1.02
»
0.99
¨
0.98
arding
0.97
eled
0.97
tted
0.94
mented
0.90
ħ
0.90
Activations Density 0.055%