INDEX
Explanations
phrases related to holding individuals or groups accountable for their actions
instances of the word "accountable" and concepts related to accountability
New Auto-Interp
Negative Logits
lyn
-0.84
ker
-0.81
fen
-0.74
mare
-0.74
othe
-0.70
Fan
-0.70
nz
-0.70
fer
-0.70
ning
-0.69
xxxx
-0.69
POSITIVE LOGITS
accountable
1.13
rity
0.82
Accountability
0.82
adjud
0.81
dilig
0.77
accountability
0.75
institution
0.73
institutions
0.71
whistlebl
0.71
Pengu
0.70
Activations Density 0.009%