INDEX
    Explanations

    references to Jewish history and societal issues related to Jews

    New Auto-Interp
    Negative Logits
     Leer
    -0.16
    tg
    -0.15
    emap
    -0.14
    tb
    -0.14
    uco
    -0.14
    dorf
    -0.13
    nid
    -0.13
    ometr
    -0.13
     cao
    -0.13
    odyn
    -0.13
    POSITIVE LOGITS
    EIF
    0.15
    EXPR
    0.14
    owy
    0.14
    vÄĽÅĻ
    0.14
     Everyday
    0.14
    ?:
    0.13
    ENTA
    0.13
    à¸ļà¸Ńà¸ģ
    0.13
    LayoutParams
    0.13
    (?:
    0.12
    Act Density 0.195%

    No Known Activations