INDEX
Explanations
The neuron consistently activates on occurrences of the word “ideal.”
New Auto-Interp
Negative Logits
bonds
-0.07
seats
-0.06
पद
-0.06
feet
-0.06
urg
-0.06
itives
-0.06
issions
-0.06
actories
-0.06
nw
-0.06
ukkan
-0.06
POSITIVE LOGITS
ANTI
0.07
loggedin
0.07
Consider
0.06
ContextHolder
0.06
_pose
0.06
_RESP
0.06
_TD
0.06
"?
0.06
‘s
0.06
(LayoutInflater
0.06
Activations Density 0.059%