INDEX
Explanations
four divisions, Arabic, Wikipedia
New Auto-Interp
Negative Logits
ρι
0.41
dere
0.41
হইতেছে
0.39
梸
0.39
됐다
0.38
ناك
0.37
പ്രവര്ത്തന
0.37
ду
0.36
dados
0.36
óricos
0.36
POSITIVE LOGITS
ويكيپيديا
0.63
تكتب
0.47
تكون
0.46
kuwa
0.41
onu
0.41
ク
0.40
Wikipédia
0.39
Catawiki
0.39
keeping
0.39
Carrera
0.39
Activations Density 0.000%