INDEX
Explanations
relationships between variables
This neuron responds to occurrences of the term “event” (especially in the phrase “other event”) in the instructions defining causal narratives.
New Auto-Interp
Negative Logits
launch
-0.07
compl
-0.06
extensive
-0.06
ลำ
-0.06
Hide
-0.06
�
-0.06
recur
-0.06
simulated
-0.06
옥
-0.06
ман
-0.06
POSITIVE LOGITS
Sections
0.07
uthor
0.07
年の
0.06
Specialist
0.06
Tips
0.06
ทาง
0.06
naturally
0.06
Scalars
0.06
.dimension
0.06
DisplayName
0.06
Activations Density 0.001%