INDEX
    Explanations

    references to nudity and body confidence

    New Auto-Interp
    Negative Logits
    ovit
    -0.18
    ayet
    -0.16
    utters
    -0.15
    429
    -0.15
    929
    -0.15
    otch
    -0.14
    orrh
    -0.14
    prog
    -0.14
    isel
    -0.14
    .elasticsearch
    -0.13
    POSITIVE LOGITS
     naked
    0.33
     bare
    0.31
    è£
    0.31
     stripping
    0.29
     natur
    0.29
     nude
    0.28
     exposed
    0.28
     stark
    0.28
     NU
    0.28
     semi
    0.27
    Act Density 0.044%

    No Known Activations