INDEX
    Explanations

    performance assessment criteria

    New Auto-Interp
    Negative Logits
     aprire
    0.57
    也会
    0.56
    自分で
    0.55
    也可以
    0.52
     famously
    0.52
     জন্যও
    0.52
    ouvrir
    0.51
     તમારે
    0.51
     dinero
    0.50
    你可以
    0.50
    POSITIVE LOGITS
     satisfactory
    0.81
     lacks
    0.80
     inadequate
    0.78
     deficiencies
    0.78
     insufficient
    0.76
     unsatisfactory
    0.76
     satisfactorily
    0.75
     lacked
    0.75
    缺乏
    0.74
     adequately
    0.74
    Act Density 0.102%

    No Known Activations