INDEX
Explanations
expressions of apologies and explanations of delays or issues
New Auto-Interp
Negative Logits
Harding
-0.17
rael
-0.17
mn
-0.15
ifter
-0.14
dangerously
-0.14
lock
-0.14
ë§Ŀ
-0.14
.Interop
-0.13
CTR
-0.13
arm
-0.13
POSITIVE LOGITS
apology
0.32
apologies
0.30
apolog
0.30
apologize
0.27
apologized
0.26
Ap
0.26
apologise
0.26
Sorry
0.23
sorry
0.22
Sorry
0.22
Activations Density 0.169%