INDEX
    Explanations

    instances of the letter 'T' or the lowercase 't'

    New Auto-Interp
    Negative Logits
     ſever
    -0.83
     faſt
    -0.80
    клопе
    -0.76
     auffi
    -0.75
     Theſe
    -0.74
     myſelf
    -0.73
     viſ
    -0.72
     незавершена
    -0.71
     itſelf
    -0.71
    الإنجليزية
    -0.70
    POSITIVE LOGITS
     t
    3.07
     T
    2.72
    T
    2.33
    t
    2.29
    getT
    1.83
     т
    1.61
     ت
    1.47
    1.41
     Т
    1.32
    𝘁
    1.27
    Act Density 0.244%

    No Known Activations