INDEX
    Explanations

    phrases that emphasize the importance of certain concepts or values

    comparisons emphasizing the importance of various concepts or elements

    New Auto-Interp
    Negative Logits
     guiActiveUn
    -0.79
    RANT
    -0.76
    flat
    -0.74
    schild
    -0.69
    UGE
    -0.68
     contracted
    -0.66
    balls
    -0.65
    Studio
    -0.65
    owler
    -0.65
    washer
    -0.64
    POSITIVE LOGITS
    protect
    0.85
     preserving
    0.82
     safegu
    0.75
     ensuring
    0.75
     educating
    0.75
     deterrence
    0.74
     protecting
    0.73
     decipher
    0.73
     apprehend
    0.71
     importance
    0.70
    Act Density 0.254%

    No Known Activations