INDEX
    Explanations

    references to political figures and their actions or affiliations

    New Auto-Interp
    Negative Logits
    ÑĨеп
    -0.09
    .updateDynamic
    -0.09
    ãĥ§
    -0.08
    liž
    -0.08
    ɵ
    -0.08
     ört
    -0.07
    ForRow
    -0.07
    subjects
    -0.07
    endum
    -0.07
    /**č↵
    -0.07
    POSITIVE LOGITS
     former
    0.28
     Former
    0.24
    former
    0.23
    Former
    0.22
     býval
    0.16
     retired
    0.15
     erst
    0.14
     formerly
    0.13
     ex
    0.12
     سابÙĤ
    0.11
    Act Density 0.095%

    No Known Activations