INDEX
    Explanations

    occurrences of the letter 'f'

    New Auto-Interp
    Negative Logits
    ¢
    -3.02
    ±
    -2.83
    º
    -2.66
    °
    -2.57
    Ļª
    -2.54
    ľĵ
    -2.52
    »¿
    -2.48
    ¼
    -2.46
    ĨĴ
    -2.43
    -2.37
    POSITIVE LOGITS
    licts
    1.86
    lict
    1.81
    athom
    1.66
    aux
    1.52
    ringes
    1.52
    inally
    1.45
    ritz
    1.44
     optics
    1.42
    ilt
    1.34
    nsic
    1.30
    Act Density 0.424%

    No Known Activations