INDEX
Explanations
mentions of romantic relationships and their dynamics
romantic relationships
mentions of a person's romantic partner or spouse.
New Auto-Interp
Negative Logits
complexType
-0.58
AssemblyCulture
-0.56
ddelweddau
-0.55
RectangleBorder
-0.55
ContentAlignment
-0.48
.")]
-0.48
TestingModule
-0.48
שוליים
-0.47
ostar
-0.47
Paroles
-0.47
POSITIVE LOGITS
+#+
0.46
boyfriend
0.44
Dates
0.43
fidanz
0.41
boyfriends
0.39
lovers
0.39
ArrowToggle
0.38
Dates
0.38
girlfriend
0.36
Dating
0.36
Activations Density 0.029%