INDEX
Explanations
the word "to"
instances of the phrase "seems to."
New Auto-Interp
Negative Logits
ãĤ¦ãĤ¹
-0.76
bats
-0.73
opens
-0.68
rieved
-0.66
cart
-0.65
estern
-0.65
GGGGGGGG
-0.65
blooded
-0.61
leanor
-0.61
ashes
-0.60
POSITIVE LOGITS
embody
1.00
imply
0.98
equate
0.95
indicate
0.95
contradict
0.94
be
0.93
defy
0.91
have
0.88
originate
0.88
specialize
0.88
Activations Density 0.072%