INDEX
Explanations
past events
This neuron detects expressions referring to a previous occasion, especially the phrase “the last time.”
New Auto-Interp
Negative Logits
afc
-0.07
oft
-0.07
rain
-0.06
.shapes
-0.06
aft
-0.06
applicant
-0.06
fdf
-0.06
Nh
-0.06
make
-0.06
Pink
-0.06
POSITIVE LOGITS
уча
0.07
').'</
0.06
?????
0.06
()))↵↵
0.06
"'"
0.06
bart
0.06
ála
0.06
#↵↵
0.06
pit
0.06
("^0.06
Activations Density 0.028%