INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    د
    1.70
    1.19
    2
    1.18
    ون
    1.14
    at
    1.13
    مة
    1.13
    inę
    1.13
    žila
    1.12
    ب
    1.12
    ו
    1.11
    POSITIVE LOGITS
    ING
    1.47
    _
    1.26
    AL
    1.20
    SH
    1.16
    '
    1.16
    AS
    1.16
    TI
    1.15
    -
    1.13
    v
    1.13
    AN
    1.09
    Act Density 0.000%

    No Known Activations