INDEX
Explanations
expressing gratitude and thanks
New Auto-Interp
Negative Logits
descom
0.39
Transactional
0.39
爭
0.38
κών
0.37
tolerated
0.37
मददगार
0.37
ఎక్కువగా
0.36
отвер
0.36
ഭം
0.35
qīng
0.35
POSITIVE LOGITS
gratitude
1.83
Grat
1.38
condolences
1.33
grat
1.29
感謝
1.27
grat
1.26
sincere
1.26
благодар
1.26
apologies
1.25
thanks
1.21
Activations Density 0.028%