INDEX
    Explanations

    terms related to politics and socio-political contexts

    New Auto-Interp
    Negative Logits
    ected
    -0.16
    lite
    -0.15
    eus
    -0.15
    y
    -0.15
    ei
    -0.15
    ein
    -0.14
    etik
    -0.14
    Vector
    -0.14
    eil
    -0.14
    ury
    -0.14
    POSITIVE LOGITS
    heid
    0.26
    heits
    0.20
    es
    0.19
    ere
    0.19
    erer
    0.19
    este
    0.17
    heit
    0.17
    ÑģÑĤÑĮ
    0.17
    weg
    0.17
    CellValue
    0.16
    Act Density 0.066%

    No Known Activations