INDEX
    Explanations

    sentences containing questions or inquiries

    New Auto-Interp
    Negative Logits
    leck
    -0.17
    shire
    -0.16
    igkeit
    -0.15
    /fixtures
    -0.15
    AGON
    -0.14
    gie
    -0.14
    -ul
    -0.14
    .tell
    -0.14
    sak
    -0.14
    reu
    -0.14
    POSITIVE LOGITS
    naires
    0.26
    naire
    0.21
    stell
    0.16
    pare
    0.16
    stown
    0.15
    ccione
    0.14
    lycer
    0.14
    stellung
    0.14
    arrow
    0.14
    eger
    0.14
    Act Density 0.041%

    No Known Activations