INDEX
Explanations
statements of apology or regret
expressions of regret or disappointment
New Auto-Interp
Negative Logits
pires
-0.74
Farming
-0.68
ordable
-0.65
Electricity
-0.65
Located
-0.65
è¦ļéĨĴ
-0.63
âĦ¢:
-0.62
mega
-0.61
ansion
-0.61
arthed
-0.60
POSITIVE LOGITS
apologized
1.37
apologize
1.36
regret
1.29
messed
1.28
deserved
1.27
regrets
1.26
regretted
1.25
apologies
1.21
screwed
1.19
apologizing
1.16
Activations Density 0.729%