INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     albums
    -0.07
     خورد
    -0.07
     Bom
    -0.07
    aja
    -0.07
     album
    -0.07
     admiration
    -0.07
    And
    -0.07
     Dallas
    -0.07
     البحث
    -0.06
     dead
    -0.06
    POSITIVE LOGITS
    Sure
    0.08
     губ
    0.07
     JAXB
    0.07
    $$$
    0.06
    .AR
    0.06
    .PER
    0.06
     smarty
    0.06
    0.06
    ",$
    0.06
     $__
    0.06
    Act Density 0.010%

    No Known Activations