INDEX
Explanations
domination and submission role-play
New Auto-Interp
Negative Logits
蛋白質
0.45
其次
0.42
損傷
0.40
transferases
0.40
nonzero
0.40
actores
0.40
chevaux
0.40
0.39
Presumably
0.39
proteins
0.38
POSITIVE LOGITS
Dom
0.71
Dom
0.57
domination
0.56
सेवक
0.56
naughty
0.55
dominant
0.54
obedient
0.54
bitch
0.54
slut
0.53
humiliated
0.52
Activations Density 0.072%