INDEX
    Explanations

    terms related to leadership

    New Auto-Interp
    Negative Logits
    fy
    -0.19
    ty
    -0.15
    use
    -0.15
     swallow
    -0.14
    hed
    -0.14
     ending
    -0.14
    dale
    -0.14
    ÙĦÙĥ
    -0.14
    achine
    -0.14
    umble
    -0.14
    POSITIVE LOGITS
    gers
    0.19
    iven
    0.18
    quarters
    0.17
    -edge
    0.16
    ONGL
    0.16
    hra
    0.15
    ivities
    0.15
    quartered
    0.14
    uria
    0.14
    _DISABLED
    0.14
    Act Density 0.055%

    No Known Activations