INDEX
Explanations
Arabic words or phrases
references to Arabic and Hebrew languages
New Auto-Interp
Negative Logits
ertodd
-1.02
llan
-0.87
hov
-0.77
alling
-0.75
lessly
-0.75
ideshow
-0.72
olicy
-0.71
zanne
-0.71
redo
-0.70
Wrestle
-0.70
POSITIVE LOGITS
transl
0.84
Corpus
0.83
flu
0.82
Hebrew
0.79
accents
0.79
language
0.75
translation
0.75
alam
0.74
language
0.74
Arabic
0.73
Activations Density 0.006%