INDEX
Explanations
expressions of admiration and delight
New Auto-Interp
Negative Logits
containment
0.43
contain
0.38
chứa
0.37
precarious
0.37
попыта
0.35
содержит
0.35
mars
0.34
содержа
0.34
హె
0.34
🌑
0.33
POSITIVE LOGITS
மகிழ்ச்ச
0.72
begeistert
0.71
মুগ্ধ
0.71
praises
0.68
delighted
0.66
memnun
0.66
happily
0.64
만족
0.63
overjoyed
0.63
pleased
0.62
Activations Density 0.100%