INDEX
    Explanations

    political terms related to power dynamics and manipulation

    terms related to social hierarchy and dependency dynamics

    New Auto-Interp
    Negative Logits
    ively
    -0.99
    sburg
    -0.89
    ispers
    -0.86
    rawdownloadcloneembedreportprint
    -0.83
    iveness
    -0.82
    osed
    -0.76
    artney
    -0.76
    maxwell
    -0.74
    adeon
    -0.74
    ains
    -0.73
    POSITIVE LOGITS
    hyde
    0.83
    doms
    0.75
     faults
    0.74
    yrinth
    0.73
    ller
    0.71
    ware
    0.70
     lodge
    0.70
    ck
    0.69
    ced
    0.68
    lda
    0.68
    Act Density 0.082%

    No Known Activations