INDEX
Explanations
phrases related to placing oneself in vulnerable or dangerous situations
New Auto-Interp
Negative Logits
querque
-0.70
imar
-0.69
llo
-0.69
glers
-0.69
heim
-0.67
orf
-0.65
lli
-0.65
catentry
-0.65
uum
-0.65
nos
-0.63
POSITIVE LOGITS
behalf
0.60
imitation
0.59
righteousness
0.59
nomine
0.59
theirs
0.57
Shutterstock
0.57
deeds
0.55
stake
0.55
whichever
0.54
jeopardy
0.54
Activations Density 0.331%