INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    کل
    0.81
     chimney
    0.70
     transporting
    0.69
    یاء
    0.69
     hauling
    0.69
     anionic
    0.68
     приема
    0.68
    𝘔
    0.68
     chattel
    0.68
    0.67
    POSITIVE LOGITS
    t
    1.01
    т
    0.89
    д
    0.80
    да
    0.79
    tio
    0.79
    jasmine
    0.78
    csak
    0.76
    rscheinlich
    0.73
    0.72
    signed
    0.72
    Act Density 0.005%

    No Known Activations