INDEX
Explanations
love and related concepts across languages
New Auto-Interp
Negative Logits
ంగా
0.65
ों
0.51
उत्ते
0.51
ంలో
0.51
માહિતી
0.50
ip
0.49
०
0.49
uk
0.48
okkal
0.48
okat
0.48
POSITIVE LOGITS
love
1.14
любви
0.97
Love
0.96
Love
0.95
любовь
0.94
ljubav
0.93
LOVE
0.92
爱情
0.88
LOVE
0.88
爱的
0.87
Activations Density 0.094%