INDEX
    Explanations

    infinitive verbs and phrases expressing capability or potential actions

    New Auto-Interp
    Negative Logits
    ycz
    -0.18
    zeitig
    -0.17
    essor
    -0.16
     cla
    -0.16
    pron
    -0.14
     mess
    -0.14
    rema
    -0.14
    ber
    -0.14
    ewan
    -0.14
    zahl
    -0.14
    POSITIVE LOGITS
    adal
    0.18
    Ïĥη
    0.15
    urovision
    0.14
    iro
    0.14
    iox
    0.14
    <context
    0.14
    /grpc
    0.14
     nameof
    0.14
    ButtonClick
    0.14
    uckets
    0.14
    Act Density 0.008%

    No Known Activations