INDEX
Explanations
instances of physical intimacy and sexual relationships
New Auto-Interp
Negative Logits
ellas
-0.17
nett
-0.15
agma
-0.15
пÑĢоÑĢ
-0.15
gin
-0.15
едж
-0.15
Magnus
-0.14
anker
-0.14
PMC
-0.14
ogh
-0.14
POSITIVE LOGITS
recip
0.16
inheritDoc
0.14
alic
0.14
å¿Ĺ
0.14
bunk
0.14
periment
0.14
grat
0.14
ürk
0.14
cassert
0.13
ASTE
0.13
Activations Density 0.101%