INDEX
Explanations
Time (am/hours/morning)
This neuron detects temporal phrases referring to early hours of the day (e.g., “early…morning,” “wee hours,” “1 am,” etc.).
New Auto-Interp
Negative Logits
Choir
-0.06
ifle
-0.06
Rena
-0.06
bla
-0.06
ワ
-0.06
,c
-0.06
FEC
-0.06
ymb
-0.06
choir
-0.06
_FRIEND
-0.06
POSITIVE LOGITS
midnight
0.07
advisor
0.07
fulfillment
0.07
) ↵ ↵ ↵
0.07
suggest
0.06
supplying
0.06
……
0.06
-is
0.06
’.↵↵
0.06
-request
0.06
Activations Density 0.008%