INDEX
Explanations
phrases related to apologizing and taking responsibility
language related to accountability and apologies
New Auto-Interp
Negative Logits
Enlarge
-0.82
toggle
-0.81
uggle
-0.65
ption
-0.64
stuff
-0.63
dry
-0.63
pop
-0.63
sidx
-0.62
?),
-0.62
rez
-0.62
POSITIVE LOGITS
sic
1.11
regrett
1.00
Plaintiff
0.86
lawful
0.84
Statement
0.80
hereby
0.79
inappropriate
0.77
Defendant
0.76
unlawfully
0.76
responsibly
0.75
Activations Density 1.345%