INDEX
    Explanations

    themes related to democracy, human rights, and the interdependence of society and the environment

    New Auto-Interp
    Negative Logits
    jets
    -0.15
    alim
    -0.15
    esub
    -0.14
    irit
    -0.14
    æķ¦
    -0.14
     entail
    -0.13
    orney
    -0.13
    cla
    -0.13
    esp
    -0.13
     differed
    -0.13
    POSITIVE LOGITS
     depends
    0.58
     depend
    0.58
    depends
    0.47
     Depends
    0.45
     depended
    0.45
    depend
    0.43
     dependent
    0.42
     Depend
    0.41
     dependence
    0.40
     relies
    0.37
    Act Density 0.092%

    No Known Activations