INDEX
    Explanations

    classify and class programming

    New Auto-Interp
    Negative Logits
    ك
    1.20
     are
    1.10
    1.06
     lên
    1.02
    .
    1.02
    the
    0.98
    .=
    0.96
     giá
    0.93
     znači
    0.92
    ように
    0.88
    POSITIVE LOGITS
    ar
    1.39
    ти
    1.28
     for
    1.13
    0
    1.13
    in
    1.08
    ار
    1.08
    un
    1.05
    uer
    1.03
    ig
    1.02
     I
    1.02
    Act Density 0.061%

    No Known Activations