INDEX
Explanations
references to the English language and its usage
english or english wikipedia
New Auto-Interp
Negative Logits
للمعارف
-0.63
oa̍t
-0.62
httphttps
-0.57
استنادى
-0.57
بوابة
-0.55
-0.54
-0.54
tonode
-0.53
AndEndTag
-0.53
⤹
-0.51
POSITIVE LOGITS
英語
0.46
inglês
0.42
英語
0.42
englisch
0.41
English
0.41
heits
0.38
international
0.38
English
0.37
english
0.37
انگلیسی
0.36
Activations Density 0.024%