INDEX
    Explanations

    the pattern "X" followed by a number

    the occurrences of the token 'X' in various contexts

    New Auto-Interp
    Negative Logits
    getic
    -0.84
    kson
    -0.79
    enegger
    -0.70
    inally
    -0.69
    ufact
    -0.68
     captcha
    -0.66
    ¢
    -0.66
    cffff
    -0.66
    Ú
    -0.65
     Poké
    -0.64
    POSITIVE LOGITS
    avier
    1.28
    peria
    1.21
    cellence
    1.14
    posed
    0.96
    aminer
    0.96
    III
    0.94
    odus
    0.92
    clusive
    0.92
    posure
    0.92
    ML
    0.92
    Act Density 0.027%

    No Known Activations