INDEX
    Explanations

    signs of disagreement or debate in a text

    New Auto-Interp
    Negative Logits
    /autoload
    -0.16
    deen
    -0.15
    asha
    -0.14
    basePath
    -0.14
    323
    -0.14
    antry
    -0.14
    ügen
    -0.14
    itä
    -0.14
     Complaint
    -0.14
     Harr
    -0.14
    POSITIVE LOGITS
    modo
    0.16
    dol
    0.16
    ä»ĭ
    0.14
    icha
    0.14
    atin
    0.14
    otropic
    0.14
    prech
    0.14
     apl
    0.14
    idia
    0.14
    odel
    0.13
    Act Density 0.002%

    No Known Activations