INDEX
Explanations
expressions of apology and regret
New Auto-Interp
Negative Logits
Hauptartikel
-0.79
ließlich
-0.68
GHIJKLM
-0.66
jutant
-0.62
læng
-0.59
bkz
-0.59
κιν
-0.59
merking
-0.55
ivably
-0.54
ajur
-0.54
POSITIVE LOGITS
apologies
1.37
apologise
1.24
apology
1.21
apologize
1.18
sorry
1.13
apologizing
1.11
forgive
1.10
Pardon
1.08
apologised
1.08
apologized
1.07
Activations Density 0.078%