INDEX
Explanations
phrases indicating attributions or blaming someone or something for a particular event or outcome
phrases indicating attribution or blame
New Auto-Interp
Negative Logits
efer
-0.88
apest
-0.86
ensable
-0.80
ŃĶ
-0.77
ourse
-0.76
ela
-0.75
utsche
-0.74
imensional
-0.73
apolis
-0.73
raf
-0.72
POSITIVE LOGITS
inexper
1.13
incompetence
1.06
negligence
1.03
faulty
1.02
lack
0.92
boredom
0.91
misunderstanding
0.90
inaction
0.90
coincidence
0.89
factors
0.89
Activations Density 0.483%