INDEX
    Explanations

    terms related to politics and government

    New Auto-Interp
    Negative Logits
     gist
    -0.65
     promoters
    -0.64
    anium
    -0.62
     imitation
    -0.61
    odium
    -0.57
     sophistication
    -0.57
    raviolet
    -0.57
     range
    -0.57
    eanor
    -0.57
     loopholes
    -0.56
    POSITIVE LOGITS
    Ļ
    1.29
    女
    1.18
    Ľ
    1.02
    ļ
    1.00
    ķ
    0.99
    ħ
    0.99
    çIJ
    0.94
    ı
    0.94
    º
    0.92
    İ
    0.92
    Act Density 0.537%

    No Known Activations