INDEX
Explanations
comparative, ending in "er"
The neuron activates on comparative adjectives (words in the "-er" form).
New Auto-Interp
Negative Logits
藝
-0.07
will
-0.07
obtained
-0.07
v
-0.07
bout
-0.07
onView
-0.07
would
-0.06
ับ
-0.06
ill
-0.06
ould
-0.06
POSITIVE LOGITS
stronger
0.12
wider
0.12
Higher
0.12
larger
0.12
faster
0.12
er
0.12
ER
0.12
harder
0.12
higher
0.11
Faster
0.11
Activations Density 0.101%