INDEX
Explanations
expressions of apology or regret
apologizing for mistakes or delays
New Auto-Interp
Negative Logits
MenuInflater
-0.44
bex
-0.42
egli
-0.42
quarie
-0.41
Leeds
-0.40
Nix
-0.39
înc
-0.39
فت
-0.39
knex
-0.38
maș
-0.38
POSITIVE LOGITS
sorry
1.77
SORRY
1.56
sorry
1.52
Sorry
1.42
Sorry
1.33
sorri
0.75
apologised
0.75
désolés
0.74
apologise
0.73
抱歉
0.73
Activations Density 0.003%