INDEX
Explanations
phrases indicating completeness or detail in descriptions
phrases that describe something being completed or accompanied by specific attributes or elements
New Auto-Interp
Negative Logits
thren
-0.72
ungle
-0.71
sts
-0.70
borgh
-0.70
orah
-0.69
nery
-0.69
ights
-0.67
lisher
-0.67
ivas
-0.67
testing
-0.66
POSITIVE LOGITS
stood
1.03
regard
0.81
draw
0.79
bells
0.79
nails
0.78
extras
0.75
drawn
0.73
impunity
0.72
jewels
0.71
ttes
0.71
Activations Density 0.149%