INDEX
    Explanations

    occurrences of the word "is"

    New Auto-Interp
    Negative Logits
    erville
    -0.18
    gether
    -0.16
    extras
    -0.15
     Deutsch
    -0.15
    quam
    -0.15
    ensex
    -0.15
    ics
    -0.14
    koli
    -0.14
    ãģĵãģĿ
    -0.14
    ils
    -0.14
    POSITIVE LOGITS
    abelle
    0.19
    otope
    0.18
    ring
    0.16
    /w
    0.15
    rig
    0.14
    ycop
    0.14
    ÌĨ
    0.14
    engin
    0.14
    one
    0.14
    erm
    0.14
    Act Density 0.154%

    No Known Activations