INDEX
    Explanations

    references to historical events and their implications on individuals and communities

    New Auto-Interp
    Negative Logits
    lobby
    -0.14
    aver
    -0.14
     cáºŃn
    -0.14
    ãģijãģªãģĦ
    -0.14
    aname
    -0.13
     homic
    -0.13
     bilder
    -0.13
     delayed
    -0.13
    irts
    -0.13
    isors
    -0.13
    POSITIVE LOGITS
    uin
    0.16
    ovÃŃd
    0.15
    .seed
    0.15
    ÎŃα
    0.15
     ngu
    0.15
     Guill
    0.15
    ysz
    0.15
    PUR
    0.14
    SSF
    0.14
     Persona
    0.14
    Act Density 0.059%

    No Known Activations