INDEX
Explanations
The neuron specifically detects the phrase “cater to.”
New Auto-Interp
Negative Logits
емых
-0.07
(Method
-0.07
valide
-0.07
STATIC
-0.07
puts
-0.06
обязатель
-0.06
Hou
-0.06
INS
-0.06
نیروی
-0.06
bumps
-0.06
POSITIVE LOGITS
cater
0.14
catering
0.11
Cater
0.11
)reader
0.07
Attribution
0.07
serving
0.06
water
0.06
met
0.06
attribution
0.06
atever
0.06
Activations Density 0.002%