INDEX
Explanations
terms related to enticing or seducing actions
words related to temptation and seduction
New Auto-Interp
Negative Logits
blance
-0.81
amily
-0.80
hops
-0.77
Found
-0.73
cedented
-0.71
bard
-0.71
elt
-0.70
Nationwide
-0.69
LAN
-0.68
oln
-0.68
POSITIVE LOGITS
lure
1.11
enticing
0.98
tempting
0.95
lured
0.95
tempt
0.94
temptation
0.89
irresistible
0.83
tempted
0.80
blackmail
0.79
coer
0.79
Activations Density 0.078%