INDEX
Explanations
apologies and expressions of regret
expressions of apology and regret
New Auto-Interp
Negative Logits
\">
-0.78
guiActiveUn
-0.76
rones
-0.75
sightings
-0.74
qi
-0.73
Farming
-0.71
mosqu
-0.71
ancies
-0.71
ndum
-0.71
population
-0.71
POSITIVE LOGITS
apologized
2.08
apology
2.07
apologize
1.95
apologizing
1.92
apologise
1.91
apologies
1.91
apologised
1.91
remorse
1.80
regrets
1.69
regret
1.67
Activations Density 0.671%