INDEX
    Explanations

    phrases related to poses and representations of child abuse

    New Auto-Interp
    Negative Logits
    хьтан
    -0.98
    CONSIN
    -0.97
    Ivo
    -0.96
    contentLoaded
    -0.93
     Verdun
    -0.90
    GEBURTSDATUM
    -0.86
    SBATCH
    -0.85
    hyrchwyd
    -0.85
     GenerationType
    -0.83
    -------------</
    -0.83
    POSITIVE LOGITS
     pose
    1.55
     Pose
    1.50
     poses
    1.48
     posed
    1.47
    Pose
    1.47
     posing
    1.40
    pose
    1.29
     posé
    0.92
     poser
    0.88
    POSE
    0.86
    Act Density 0.007%

    No Known Activations