INDEX
Explanations
text related to recommendation systems, specifically focusing on collaborative filtering and how user preferences are assessed.
This neuron fires on phrases describing a product–customer interaction—specifically tokens in the standard “product being viewed or purchased by a customer” construction.
New Auto-Interp
Negative Logits
father
-0.07
질
-0.07
indi
-0.06
vides
-0.06
lerdir
-0.06
泊
-0.06
一覧
-0.06
ture
-0.06
kê
-0.06
Scientists
-0.06
POSITIVE LOGITS
updates
0.07
updated
0.06
rebuild
0.06
惊
0.06
_COLL
0.06
insights
0.06
(updated
0.06
PF
0.06
720
0.06
30
0.06
Activations Density 0.061%