INDEX
Explanations
phrases related to falling into a certain category or situation
phrases that indicate the concept of falling into traps or categories
New Auto-Interp
Negative Logits
be
-0.69
hunt
-0.68
indu
-0.67
endors
-0.66
toe
-0.65
enery
-0.65
toured
-0.62
iere
-0.62
conn
-0.61
press
-0.61
POSITIVE LOGITS
obscurity
0.75
whichever
0.74
Disorder
0.71
Seg
0.70
submission
0.69
Role
0.68
limbo
0.67
trap
0.66
position
0.66
olester
0.66
Activations Density 0.075%