INDEX
    Explanations

    closing punctuation followed by new sentence

    New Auto-Interp
    Negative Logits
    ان
    0.43
    0.43
    0.39
    ن
    0.37
    }_{+}
    0.36
    ncnc
    0.36
    on
    0.36
    0.34
    as
    0.33
    ra
    0.33
    POSITIVE LOGITS
     in
    0.71
     an
    0.48
    0.45
     be
    0.43
    0.41
     I
    0.41
    0.40
     and
    0.40
    0.39
     nine
    0.39
    Act Density 0.000%

    No Known Activations