INDEX
Explanations
expressions of apology or regret
New Auto-Interp
Negative Logits
Werde
-0.32
cupboards
-0.31
listdir
-0.31
sweise
-0.31
XtraBars
-0.31
からです
-0.31
果
-0.31
Initiatives
-0.31
prefeitura
-0.31
villaggio
-0.31
POSITIVE LOGITS
sorry
1.30
sorry
1.25
SORRY
1.22
Sorry
1.21
Sorry
1.20
apologize
1.01
apologies
0.95
apologise
0.95
apologized
0.94
apology
0.93
Activations Density 0.119%