INDEX
Explanations
classical
The neuron primarily activates on the adjective “classical,” marking instances where something is described as classical (e.g., classical dynamics, classical field, classical analysis).
New Auto-Interp
Negative Logits
Makeup
-0.07
S
-0.07
нос
-0.07
nurs
-0.07
validar
-0.07
h
-0.07
methods
-0.06
��
-0.06
train
-0.06
Fac
-0.06
POSITIVE LOGITS
ạt
0.08
thiếu
0.07
Arb
0.07
Quân
0.06
_esc
0.06
-Sah
0.06
irrespective
0.06
semiclass
0.06
dem
0.06
chipset
0.06
Activations Density 0.005%