INDEX
    Explanations

    references to leadership roles and titles in various contexts

    New Auto-Interp
    Negative Logits
    wap
    -0.14
    Interceptor
    -0.14
    udas
    -0.14
    osti
    -0.13
     colleagues
    -0.13
    alian
    -0.13
    iate
    -0.13
    thood
    -0.13
    antages
    -0.13
     Economist
    -0.13
    POSITIVE LOGITS
     driving
    0.37
     brains
    0.33
    brains
    0.28
     force
    0.28
    -driving
    0.28
     Driving
    0.27
     co
    0.27
     creator
    0.25
     inst
    0.24
    Driving
    0.23
    Act Density 0.135%

    No Known Activations