INDEX
Explanations
phrases indicating examples or instances
New Auto-Interp
Negative Logits
demás
-0.49
übrigen
-0.44
other
-0.40
demais
-0.39
otros
-0.39
altri
-0.39
outra
-0.39
lainnya
-0.38
otras
-0.36
beiden
-0.36
POSITIVE LOGITS
those
1.05
الرياضيه
0.99
those
0.89
ceux
0.85
celles
0.84
كومونز
0.84
namely
0.83
ones
0.81
الحره
0.81
quelli
0.80
Activations Density 0.703%