INDEX
Explanations
terms related to illegal or taboo relationships, specifically incest
references to incest
New Auto-Interp
Negative Logits
deen
-0.77
eem
-0.74
zl
-0.72
oÄŁ
-0.71
kinson
-0.69
ngth
-0.69
kar
-0.68
vich
-0.67
ez
-0.67
eting
-0.66
POSITIVE LOGITS
incest
1.17
cest
0.87
uous
0.82
suff
0.73
opt
0.72
uring
0.71
coerc
0.71
ortium
0.71
itus
0.70
Consent
0.70
Activations Density 0.027%