INDEX
    Explanations

    teaching/education

    New Auto-Interp
    Negative Logits
    天çĦ¶
    -0.26
    isson
    -0.25
    éĹ´æİ¥
    -0.25
    è¿Ļæĸ¹éĿ¢
    -0.24
    zburg
    -0.24
     disproportion
    -0.24
    æį
    -0.24
    jen
    -0.24
    beer
    -0.24
    ä¿Ĺç§°
    -0.24
    POSITIVE LOGITS
    arming
    0.29
     educating
    0.28
     ages
    0.27
    Ãłnh
    0.25
    inator
    0.25
    åħ¨æĹ¥
    0.24
    éĺ´éĺ³
    0.24
    辨åĪ«
    0.24
    æķĻèĤ²
    0.24
     Panama
    0.24
    Act Density 4.348%

    No Known Activations