INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -"+
    -0.07
     Scratch
    -0.06
    Discover
    -0.06
    -0.06
     Browse
    -0.06
     implementations
    -0.06
     -->
    -0.06
     Birth
    -0.06
    isContained
    -0.06
    طبيق
    -0.06
    POSITIVE LOGITS
     attitude
    0.10
     stance
    0.09
     outlook
    0.07
     हर
    0.06
    LOOK
    0.06
     Grape
    0.06
     перс
    0.06
     posture
    0.06
     HUD
    0.06
     mentality
    0.06
    Act Density 0.007%

    No Known Activations