INDEX
Explanations
The neuron responds to words that signal importance or necessity (e.g. “important,” “crucial”).
New Auto-Interp
Negative Logits
Datetime
-0.08
líž
-0.08
alendar
-0.07
Sleep
-0.07
.get
-0.07
ieten
-0.07
ngày
-0.07
falling
-0.07
title
-0.07
malar
-0.07
POSITIVE LOGITS
crucial
0.13
Cruc
0.09
�
0.07
quan
0.07
@js
0.07
Crimes
0.07
??
0.06
essential
0.06
vital
0.06
necess
0.06
Activations Density 0.019%