INDEX
Explanations
products
The neuron detects promotional language emphasizing the provision of high-quality products and services.
New Auto-Interp
Negative Logits
해야
-0.07
schemes
-0.07
RL
-0.06
opathy
-0.06
côt
-0.06
�
-0.06
Errors
-0.06
[]"
-0.06
.raise
-0.05
cli
-0.05
POSITIVE LOGITS
liqu
0.07
pps
0.07
Francie
0.07
BAB
0.07
intrigue
0.07
(component
0.07
ViewData
0.07
recruited
0.06
aybe
0.06
.device
0.06
Activations Density 0.037%