INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    2
    1.24
    3
    1.03
    ی
    1.03
    (
    1.00
    ،
    0.98
    and
    0.95
     avere
    0.95
     avez
    0.93
    4
    0.93
     amu
    0.92
    POSITIVE LOGITS
    ة
    1.20
    с
    1.06
    га
    0.98
    0.98
    ton
    0.94
    <0xB5>
    0.93
     in
    0.93
    ين
    0.92
    ون
    0.91
    do
    0.90
    Act Density 0.000%

    No Known Activations