INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    or
    0.82
    ile
    0.75
    in
    0.68
    '
    0.67
    ot
    0.65
    the
    0.63
    .
    0.63
    بی
    0.60
    0.60
     بی
    0.59
    POSITIVE LOGITS
     процесс
    1.08
     процессы
    1.06
     processes
    1.04
    过程
    1.03
     process
    1.02
     Processes
    0.98
     Proces
    0.96
     Process
    0.93
     과정을
    0.92
     Prozesse
    0.91
    Act Density 0.227%

    No Known Activations