INDEX
Explanations
correspondingly, respectively, both
New Auto-Interp
Negative Logits
giant
0.73
मुझे
0.66
мне
0.66
huge
0.65
musk
0.65
calf
0.64
gigantes
0.63
me
0.62
saya
0.62
били
0.62
POSITIVE LOGITS
likewise
1.01
correspondingly
0.98
دونوں
0.84
simultaneously
0.84
رجع
0.82
ገድ
0.82
역시
0.82
entrambe
0.81
Aynı
0.81
соответственно
0.81
Activations Density 2.570%