INDEX
Explanations
words related to apologies and acts of apologizing
instances of the word "apologize" and its variations
New Auto-Interp
Negative Logits
weeney
-0.88
corn
-0.80
jun
-0.76
spot
-0.76
arnaev
-0.76
marked
-0.75
VEN
-0.71
lining
-0.71
vet
-0.68
eding
-0.68
POSITIVE LOGITS
apologize
1.14
apologise
1.08
apologised
1.07
apologized
1.07
apology
1.00
apologizing
0.99
apologies
0.97
sorry
0.84
unres
0.81
apolog
0.76
Activations Density 0.012%