INDEX
Explanations
concepts related to accountability and responsibility dynamics in various contexts.
causes negative outcomes
New Auto-Interp
Negative Logits
utilizza
0.28
contains
0.27
contains
0.27
utilising
0.27
utilises
0.27
använder
0.27
utilizes
0.26
utilisent
0.26
utilizzando
0.26
using
0.26
POSITIVE LOGITS
導致
0.39
causar
0.36
resentment
0.34
demoral
0.33
导致
0.32
ocasion
0.32
unwillingness
0.31
injustice
0.31
feelings
0.31
reluctance
0.31
Activations Density 0.646%