INDEX
Explanations
words related to assigning responsibility or blame
phrases related to accountability and blame in various contexts
New Auto-Interp
Negative Logits
idav
-0.66
ynchronous
-0.66
ubs
-0.63
hemer
-0.63
fol
-0.62
earchers
-0.62
dayName
-0.61
gur
-0.61
ude
-0.60
liga
-0.60
POSITIVE LOGITS
shaping
0.83
deaths
0.80
orship
0.80
influencing
0.69
nton
0.67
perpet
0.65
lives
0.65
steering
0.64
revolutions
0.64
Lives
0.63
Activations Density 0.564%