INDEX
    Explanations

    personality descriptions

    This neuron responds to descriptive adjectives and phrases that signal confidence, assertiveness, and leadership qualities.

    New Auto-Interp
    Negative Logits
     си
    -0.07
     Fram
    -0.06
    -0.06
     Как
    -0.06
     перв
    -0.06
     Ngoài
    -0.06
    _MAP
    -0.06
    (extra
    -0.06
    _fg
    -0.06
    -0.06
    POSITIVE LOGITS
     compiled
    0.06
    ruh
    0.06
    aravel
    0.06
     zdrav
    0.06
    ослав
    0.06
     бюджет
    0.06
     gladly
    0.06
    INS
    0.06
    金额
    0.06
     маш
    0.06
    Act Density 0.091%

    No Known Activations