INDEX
Explanations
words related to apologies and expressions of regret
instances of apologies and expressions of remorse
New Auto-Interp
Negative Logits
weeney
-0.80
arnaev
-0.76
marked
-0.72
Downloadha
-0.71
adj
-0.70
estial
-0.69
markets
-0.68
ther
-0.67
tein
-0.67
::::::::
-0.67
POSITIVE LOGITS
giving
1.01
unres
0.99
apology
0.90
apologized
0.88
apologize
0.87
forgiveness
0.84
apologised
0.82
apologizing
0.81
apologise
0.78
apologies
0.78
Activations Density 0.027%