INDEX
Explanations
phrases related to poses and representations of child abuse
New Auto-Interp
Negative Logits
хьтан
-0.98
CONSIN
-0.97
Ivo
-0.96
contentLoaded
-0.93
Verdun
-0.90
GEBURTSDATUM
-0.86
SBATCH
-0.85
hyrchwyd
-0.85
GenerationType
-0.83
-------------</
-0.83
POSITIVE LOGITS
pose
1.55
Pose
1.50
poses
1.48
posed
1.47
Pose
1.47
posing
1.40
pose
1.29
posé
0.92
poser
0.88
POSE
0.86
Activations Density 0.007%