INDEX
    Explanations

    cheating and unfair advantage

    New Auto-Interp
    Negative Logits
    س
    1.09
    1.03
    1.01
    ی
    0.99
    ش
    0.92
    0.92
    اء
    0.91
    و
    0.91
    0.89
    </h2>
    0.89
    POSITIVE LOGITS
     cheating
    1.24
     cheated
    1.22
     Cheat
    1.02
    ení
    0.99
     cheats
    0.98
     cheat
    0.94
    ية
    0.84
     органов
    0.84
    រយៈ
    0.84
     роста
    0.82
    Act Density 0.008%

    No Known Activations