INDEX
Explanations
words related to confessions or admissions of guilt or wrongdoing
expressions of acknowledgment or admission of wrongdoing
New Auto-Interp
Negative Logits
asus
-0.84
ILCS
-0.76
assic
-0.74
rouse
-0.74
osi
-0.72
phabet
-0.72
DragonMagazine
-0.71
rior
-0.71
ighth
-0.70
prev
-0.70
POSITIVE LOGITS
defeat
1.20
wrongdoing
1.16
guilt
1.08
mistakes
1.00
ignorance
0.93
fault
0.92
responsibility
0.90
shortcomings
0.88
admitting
0.87
failings
0.84
Activations Density 0.051%