INDEX
Explanations
motivation and goals
This neuron activates on words related to perseverance and disciplined effort, such as persistence, resistance, sacrifice, and staying focused.
New Auto-Interp
Negative Logits
füg
-0.07
IPP
-0.07
'-
-0.07
ImageData
-0.07
книж
-0.07
忘
-0.07
kvinn
-0.07
memorial
-0.06
ayı
-0.06
ETA
-0.06
POSITIVE LOGITS
-square
0.07
086
0.07
ức
0.06
directional
0.06
oggled
0.06
proficiency
0.06
ร
0.06
analyzing
0.05
Figure
0.05
�
0.05
Activations Density 0.031%