INDEX
Explanations
phrases and descriptions related to infidelity or illicit romantic relationships
references to romantic or extramarital relationships
New Auto-Interp
Negative Logits
solid
-0.87
srf
-0.74
ACTED
-0.71
printed
-0.71
ovi
-0.71
externalActionCode
-0.70
owicz
-0.70
imm
-0.69
gments
-0.68
umbing
-0.68
POSITIVE LOGITS
affair
0.93
ional
0.79
ttes
0.77
involving
0.73
uality
0.73
revolving
0.72
contrace
0.71
ilial
0.68
cov
0.66
ually
0.66
Activations Density 0.022%