INDEX
    Explanations

    halftime/time

    New Auto-Interp
    Negative Logits
    806
    -0.07
     پرداخت
    -0.07
     dış
    -0.07
    Comparison
    -0.06
    eliminar
    -0.06
     Erdoğan
    -0.06
     Über
    -0.06
     Trash
    -0.06
    _soup
    -0.06
     exam
    -0.06
    POSITIVE LOGITS
     halftime
    0.09
     Subscribe
    0.07
    iral
    0.07
    وع
    0.06
    CALE
    0.06
    /comments
    0.06
    .images
    0.06
    _REF
    0.06
    _dataset
    0.06
    :].
    0.06
    Act Density 0.008%

    No Known Activations