INDEX
Explanations
references to romance and romantic story elements in various narratives
New Auto-Interp
Negative Logits
lug
-0.19
ÄĻd
-0.16
bler
-0.16
asje
-0.15
оби
-0.15
hol
-0.14
ATOR
-0.14
èĻ«
-0.14
ollower
-0.14
apyrus
-0.14
POSITIVE LOGITS
romance
0.19
billionaire
0.18
heat
0.18
Romance
0.18
heat
0.18
Heat
0.17
billionaires
0.17
hotter
0.16
alpha
0.16
Heat
0.16
Activations Density 0.059%