INDEX
    Explanations

    references to Western influence and interactions with the Muslim world

    New Auto-Interp
    Negative Logits
    ensem
    -0.07
    rud
    -0.07
     Yup
    -0.07
    rei
    -0.06
    udit
    -0.06
     Democr
    -0.06
    lli
    -0.06
    sei
    -0.06
    abus
    -0.06
    eldig
    -0.06
    POSITIVE LOGITS
     Partition
    0.07
    Partition
    0.07
    )frame
    0.07
    raig
    0.07
    iola
    0.07
    partition
    0.07
    xED
    0.07
    vÄĽÅĻ
    0.07
    iena
    0.06
    ãģĸ
    0.06
    Act Density 0.001%

    No Known Activations