INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    in
    1.30
    en
    1.21
    as
    1.20
    er
    1.12
    u
    1.01
    at
    1.00
    il
    0.98
    am
    0.95
    ar
    0.93
    ang
    0.89
    POSITIVE LOGITS
    <0x0D>
    0.83
    ِي
    0.70
    </h3>
    0.70
    َاب
    0.69
     capitán
    0.69
    </h2>
    0.68
    -
    0.68
     Públic
    0.67
    </th>
    0.65
    ীয়
    0.65
    Act Density 0.004%

    No Known Activations