INDEX
    Explanations

    the presence of the word "Ar."

    New Auto-Interp
    Negative Logits
    ագրություններ
    -0.98
    SOUNDBITE
    -0.94
     myſelf
    -0.86
     Theſe
    -0.84
     pleaſure
    -0.82
     Anſ
    -0.81
    webElementXpaths
    -0.80
     Beſ
    -0.80
     ་་
    -0.80
     Diſ
    -0.80
    POSITIVE LOGITS
     Ar
    3.09
    Ar
    2.90
     ar
    2.56
     AR
    2.05
     Ар
    1.92
    ar
    1.75
    Ар
    1.71
    AR
    1.59
     ар
    1.57
     Arb
    1.35
    Act Density 0.068%

    No Known Activations