INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     роботи
    -0.07
     vet
    -0.06
     خواهد
    -0.06
     kit
    -0.06
    -MM
    -0.06
    _va
    -0.06
     chung
    -0.06
     queries
    -0.06
     zprav
    -0.06
     quarterly
    -0.06
    POSITIVE LOGITS
    bool
    0.07
    'RE
    0.07
    Screens
    0.06
     thousand
    0.06
    ?>
    0.06
     Prim
    0.06
    CGSize
    0.06
    سة
    0.06
    bellion
    0.06
    ينية
    0.06
    Act Density 0.042%

    No Known Activations