INDEX
    Explanations

    religion/worship

    New Auto-Interp
    Negative Logits
    åī©ä¸ĭ
    -0.27
    qua
    -0.27
    odge
    -0.26
    è¿ĻçĤ¹
    -0.26
    åı¯è§ģ
    -0.26
    çļĦä¼ĺåĬ¿
    -0.26
    å«ģ
    -0.26
    remain
    -0.26
    Exactly
    -0.25
    anka
    -0.25
    POSITIVE LOGITS
     dönÃ¼ÅŁ
    0.28
    alous
    0.27
     prescribing
    0.25
    åıij
    0.25
    здание
    0.24
    _related
    0.24
     alma
    0.24
    NAL
    0.24
    sworth
    0.23
     commuter
    0.23
    Act Density 0.001%

    No Known Activations