INDEX
Explanations
apologies or statements of regret
expressions of apology and regret
New Auto-Interp
Negative Logits
FTWARE
-0.79
DragonMagazine
-0.78
bard
-0.75
kefeller
-0.73
fman
-0.69
atown
-0.67
vantage
-0.67
otic
-0.66
Ready
-0.64
Mehran
-0.64
POSITIVE LOGITS
mistakes
1.25
inconvenience
1.16
inconven
1.14
mist
1.05
hurting
1.03
unintentional
1.02
unintentionally
1.01
offended
1.00
ruining
1.00
sins
1.00
Activations Density 0.145%