INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Bailey
    -0.10
     Ind
    -0.10
     jogging
    -0.09
    2
    -0.09
     fir
    -0.09
     Marble
    -0.09
     Mann
    -0.09
     Minecraft
    -0.09
     Deferred
    -0.09
    orphism
    -0.09
    POSITIVE LOGITS
    BODY
    0.11
     body
    0.10
    Resistance
    0.10
     ply
    0.10
    |#
    0.10
     ROM
    0.10
     BODY
    0.10
     Body
    0.10
     resistance
    0.09
    Vals
    0.09
    Act Density 0.006%

    No Known Activations