INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    мыш
    0.89
     استخدم
    0.86
     аккумулятор
    0.85
    0.83
     ምክንያ
    0.82
    cations
    0.80
    ayaran
    0.80
     монасты
    0.79
    0.79
     эффективности
    0.78
    POSITIVE LOGITS
    <start_of_image>
    0.73
     fe
    0.71
     new
    0.70
     gast
    0.70
    同時に
    0.68
    続いて
    0.68
     public
    0.68
     nation
    0.67
    ↵↵
    0.66
     the
    0.65
    Act Density 0.000%

    No Known Activations