INDEX
Explanations
words related to temptation and desire
tempting, luring, enticing
New Auto-Interp
Negative Logits
PhysRev
-0.48
endregion
-0.43
normas
-0.42
bäst
-0.41
paikan
-0.41
mierda
-0.40
tää
-0.40
Manns
-0.39
endregion
-0.39
Citiți
-0.39
POSITIVE LOGITS
temptation
1.01
tempting
1.00
temptations
0.88
tempted
0.88
tempt
0.86
Temptation
0.82
enticing
0.81
Tempt
0.81
entice
0.79
lure
0.77
Activations Density 0.007%