INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ق
    1.43
    ك
    1.28
     in
    0.96
    ра
    0.94
    است
    0.93
    0.93
    0.93
    від
    0.91
    ס
    0.91
    0.90
    POSITIVE LOGITS
    and
    1.20
     coordinates
    1.07
    1.00
    0.97
    larından
    0.95
     fica
    0.94
     좌표
    0.94
    '
    0.91
    of
    0.90
    ib
    0.87
    Act Density 0.006%

    No Known Activations