INDEX
    Explanations

    phrases indicating rationale or justification

    New Auto-Interp
    Negative Logits
    multer
    -0.46
    دانشنامهٔ
    -0.44
    Jîn
    -0.42
    dengan
    -0.39
    djangoproject
    -0.39
    Попис
    -0.38
    cotch
    -0.37
    ASKET
    -0.36
    󠁬
    -0.36
    󠁮
    -0.35
    POSITIVE LOGITS
     why
    0.97
    why
    0.70
     mengapa
    0.68
     warum
    0.64
     kenapa
    0.63
     Reason
    0.62
     WHY
    0.62
     behind
    0.59
     reason
    0.58
     waarom
    0.56
    Act Density 0.240%

    No Known Activations