INDEX
    Explanations

    references to danger or conflict related to power dynamics

    New Auto-Interp
    Negative Logits
     informée
    -0.54
     defaultstate
    -0.47
    webElementXpaths
    -0.45
    Tracce
    -0.43
     comprised
    -0.43
     EconPapers
    -0.43
     heretofore
    -0.41
    Дереккөздер
    -0.41
     utilised
    -0.39
    Aiheesta
    -0.38
    POSITIVE LOGITS
     Coordin
    0.57
     Dinas
    0.56
     strongest
    0.55
     safest
    0.52
     diet
    0.52
     herbal
    0.52
     Noticias
    0.51
     herbs
    0.51
     best
    0.50
     does
    0.49
    Act Density 0.032%

    No Known Activations