INDEX
    Explanations

    Chinese Pinyin and characters

    New Auto-Interp
    Negative Logits
     гульнявыя
    0.58
    0.57
    𝒍
    0.57
    Раз
    0.52
    таў
    0.51
    0.51
    біць
    0.50
    0.50
    0.50
     Стаўкі
    0.50
    POSITIVE LOGITS
     ép
    0.45
     quan
    0.43
    úng
    0.43
     Liao
    0.42
    0.42
    ép
    0.41
    0.39
    ương
    0.38
    0.37
    0.37
    Act Density 0.014%

    No Known Activations