INDEX
    Explanations

    terms related to the effectiveness of programs or interventions

    New Auto-Interp
    Negative Logits
    omial
    -0.08
       
    -0.07
    eric
    -0.07
    utes
    -0.07
    Left
    -0.07
    igan
    -0.06
     lại
    -0.06
    ccess
    -0.06
     Left
    -0.06
    ipt
    -0.06
    POSITIVE LOGITS
    dorf
    0.07
    amu
    0.07
    ื
    0.07
    eson
    0.07
    bÃŃ
    0.07
    ines
    0.07
    rchive
    0.07
    atin
    0.07
     Fal
    0.07
    onec
    0.07
    Act Density 0.007%

    No Known Activations