INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     boxes
    -0.08
    mind
    -0.08
     cases
    -0.07
     Developer
    -0.07
     need
    -0.07
    /portfolio
    -0.07
     terrorism
    -0.07
     body
    -0.07
     сохра
    -0.07
    fabric
    -0.06
    POSITIVE LOGITS
     greeting
    0.09
    0.07
     greet
    0.06
    .defaultProps
    0.06
     greeted
    0.06
    Greetings
    0.06
    reeting
    0.06
    .keep
    0.06
     greetings
    0.06
    gate
    0.06
    Act Density 0.008%

    No Known Activations