INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ویکی‌پدیا
    -0.41
    ECONDS
    -0.38
    PhysRev
    -0.37
     chine
    -0.37
    -0.36
    wold
    -0.36
    gory
    -0.36
     sortie
    -0.35
    tsy
    -0.35
    acterium
    -0.35
    POSITIVE LOGITS
    principalColumn
    0.50
    ]")]
    0.49
     tartalomajánló
    0.47
    zzleHttp
    0.46
    SharedCtor
    0.46
    RectangleBorder
    0.45
     BoxFit
    0.43
     vPvB
    0.42
     spese
    0.42
    AntiForgeryToken
    0.42
    Act Density 0.002%

    No Known Activations