INDEX
Explanations
This neuron detects references to the word “year,” especially in temporal phrases like “last year.”
New Auto-Interp
Negative Logits
날
-0.07
く
-0.07
addon
-0.06
Websites
-0.06
ห
-0.06
нин
-0.06
элемент
-0.06
()<
-0.06
solely
-0.06
Romantic
-0.06
POSITIVE LOGITS
SGD
0.07
®
0.07
_bm
0.06
příspěv
0.06
egree
0.06
.flag
0.06
UILDER
0.06
ейств
0.06
igel
0.06
peng
0.06
Activations Density 0.018%