INDEX
    Explanations

    function words that indicate relationships or comparisons

    New Auto-Interp
    Negative Logits
    öm
    -0.15
    stell
    -0.15
    ().'/
    -0.14
    ruise
    -0.14
    kening
    -0.14
    .store
    -0.14
     Heading
    -0.14
    HeaderCode
    -0.14
    aven
    -0.14
     patter
    -0.14
    POSITIVE LOGITS
     hole
    0.15
     Nam
    0.15
    ijd
    0.14
    ritz
    0.14
     Gir
    0.14
    igne
    0.14
    toy
    0.14
     backed
    0.14
    idders
    0.14
     Hole
    0.14
    Act Density 0.000%

    No Known Activations