INDEX
    Explanations

    phrases emphasizing relationships and connections between concepts

    New Auto-Interp
    Negative Logits
     تكبرها
    -0.60
    endif
    -0.55
     endif
    -0.55
    djangoproject
    -0.55
    angu
    -0.52
     ostavi
    -0.52
     dimas
    -0.52
     entr
    -0.50
     tarto
    -0.50
    ياه
    -0.50
    POSITIVE LOGITS
     the
    0.76
    GOTREF
    0.76
     them
    0.65
    InvalidProtocol
    0.62
    SBATCH
    0.61
    mación
    0.60
    edos
    0.60
     NSCoder
    0.59
    Còn
    0.56
    eat
    0.56
    Act Density 0.457%

    No Known Activations