INDEX
Explanations
list items separated by context
New Auto-Interp
Negative Logits
якщо
0.43
कर्ता
0.43
jeśli
0.43
सेंटीमीटर
0.41
在你
0.41
Registro
0.41
immagine
0.40
encije
0.40
enzione
0.39
मुझसे
0.39
POSITIVE LOGITS
this
0.95
it
0.85
these
0.83
we
0.78
ĝi
0.76
इसने
0.70
această
0.70
этой
0.69
diese
0.66
tämä
0.66
Activations Density 0.073%