INDEX
Explanations
parenthetical actions and tones
New Auto-Interp
Negative Logits
ొప్ప
0.45
thrice
0.41
시절
0.41
peculiarities
0.41
detonations
0.40
granit
0.40
मित्रों
0.39
喜爱
0.39
solubilities
0.39
ərd
0.38
POSITIVE LOGITS
shrug
0.84
smirk
0.80
playfully
0.79
smiling
0.75
looking
0.75
smiled
0.71
softly
0.70
shrugged
0.70
smiles
0.69
yawn
0.68
Activations Density 0.031%