INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ввод
    0.71
     Executive
    0.70
     EXECUTIVE
    0.65
     executive
    0.64
    ंदी
    0.63
    Neb
    0.62
    Executive
    0.61
     Args
    0.61
     chestnut
    0.61
     Abrams
    0.59
    POSITIVE LOGITS
     bằng
    1.08
     بہ
    0.91
     ب
    0.91
     بالأ
    0.90
     بال
    0.90
     به
    0.89
     بالك
    0.87
     بها
    0.85
     بم
    0.83
     بأ
    0.82
    Act Density 0.052%

    No Known Activations