INDEX
Explanations
statements expressing gratitude or requests for assistance
expressions of desire or preference
New Auto-Interp
Negative Logits
VERTISEMENT
-0.84
ingen
-0.72
@#&
-0.70
sequence
-0.69
shock
-0.65
aan
-0.63
idious
-0.62
ubb
-0.62
aring
-0.61
Jung
-0.61
POSITIVE LOGITS
revenge
0.74
forgiveness
0.70
ransom
0.68
assurances
0.66
clarification
0.63
assurance
0.62
redress
0.62
gery
0.62
hya
0.61
lier
0.61
Activations Density 0.042%