INDEX
    Explanations

    individuals and their affiliations or roles

    instances of the word "who" in relation to individuals and their descriptions or roles

    New Auto-Interp
    Negative Logits
     economical
    -0.67
     misogyn
    -0.67
    Georg
    -0.64
     arbitrary
    -0.64
    urg
    -0.63
    destruct
    -0.63
    ³³³³
    -0.62
     logical
    -0.62
    Failure
    -0.61
    Nut
    -0.59
    POSITIVE LOGITS
     oversaw
    1.20
     oversees
    1.16
     participated
    1.07
     attended
    1.06
     owns
    1.05
     attends
    1.02
     specializes
    1.02
     specialize
    1.01
     authored
    0.96
     chaired
    0.95
    Act Density 0.100%

    No Known Activations