INDEX
    Explanations

    Arabic words or phrases

    references to Arabic and Hebrew languages

    New Auto-Interp
    Negative Logits
    ertodd
    -1.02
    llan
    -0.87
    hov
    -0.77
    alling
    -0.75
    lessly
    -0.75
    ideshow
    -0.72
    olicy
    -0.71
    zanne
    -0.71
    redo
    -0.70
     Wrestle
    -0.70
    POSITIVE LOGITS
     transl
    0.84
     Corpus
    0.83
     flu
    0.82
     Hebrew
    0.79
     accents
    0.79
     language
    0.75
     translation
    0.75
    alam
    0.74
    language
    0.74
     Arabic
    0.73
    Act Density 0.006%

    No Known Activations