INDEX
    Explanations

    multilingual script identifiers

    New Auto-Interp
    Negative Logits
     a
    1.67
    1.57
    1.53
     at
    1.45
    1.35
    1.21
    ą
    1.20
    ة
    1.16
    1.13
    К
    1.10
    POSITIVE LOGITS
    uje
    1.34
    c
    1.15
    ן
    1.14
    ז
    1.05
    p
    1.03
    িন
    1.02
    ет
    1.02
    g
    1.00
    ih
    0.97
    she
    0.96
    Act Density 0.349%

    No Known Activations