INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    a
    3.77
    3.44
    ه
    3.43
    ер
    3.41
    িং
    3.28
    er
    3.27
    aik
    3.21
    e
    3.18
    erent
    3.11
    3.08
    POSITIVE LOGITS
    ting
    4.73
    ty
    3.99
    tttt
    3.90
    to
    3.80
    ৃত্বে
    3.79
    ttt
    3.65
    ীব্র
    3.61
    ta
    3.45
    መሳሳይ
    3.38
    न्त्र
    3.25
    Act Density 0.729%

    No Known Activations