INDEX
Explanations
The neuron activates on the word “INCLUDING” (typically the parenthesized “(INCLUDING …)” clause) in license‐style disclaimers.
New Auto-Interp
Negative Logits
�
-0.06
Containers
-0.06
��
-0.06
ayacak
-0.05
Motors
-0.05
casual
-0.05
-color
-0.05
therefore
-0.05
enfrent
-0.05
416
-0.05
POSITIVE LOGITS
}↵↵
0.07
//{↵0.07
HT
0.07
/'↵
0.07
一度
0.07
{{↵0.06
граду
0.06
илання
0.06
تحصیل
0.06
Хар
0.06
Activations Density 0.000%