INDEX
Explanations
specific phrases indicating being tied or connected
instances of the word "for" in various contexts
New Auto-Interp
Negative Logits
DOWN
-0.71
soever
-0.67
alty
-0.66
properties
-0.66
lees
-0.65
mare
-0.65
leigh
-0.63
THANK
-0.61
edin
-0.60
arios
-0.60
POSITIVE LOGITS
bidden
1.06
geries
0.94
gery
0.90
gotten
0.89
hire
0.72
ays
0.72
example
0.71
instance
0.69
starters
0.69
ked
0.68
Activations Density 0.116%