INDEX
Explanations
admiration for work and achievements
New Auto-Interp
Negative Logits
確保
0.41
можем
0.40
我們可以
0.39
원래
0.38
ಯಾವುದೇ
0.38
ามารถ
0.37
ابقه
0.37
kommt
0.37
ponemos
0.37
通常
0.37
POSITIVE LOGITS
inspiring
0.75
fascin
0.68
admired
0.67
fascinated
0.67
admiring
0.66
admires
0.65
admire
0.64
inspirational
0.63
fascinating
0.63
admiration
0.62
Activations Density 0.108%