INDEX
    Explanations

    references to leadership qualities and professional expertise

    New Auto-Interp
    Negative Logits
    âĪı
    -0.19
     labour
    -0.17
    cie
    -0.17
     connexion
    -0.17
    ulia
    -0.17
     Fortune
    -0.16
    avourite
    -0.16
     beaut
    -0.15
     honour
    -0.15
     honoured
    -0.15
    POSITIVE LOGITS
     à¹Ĩ
    0.17
    Armor
    0.15
    pj
    0.15
     embedding
    0.15
     dementia
    0.14
    906
    0.14
     Armor
    0.14
    .Magenta
    0.14
    frag
    0.14
     Capability
    0.14
    Act Density 0.152%

    No Known Activations