INDEX
Explanations
legal documents
This neuron is recognizing tokens based on their position in the document, activating most strongly on words in the middle sections of these legal texts.
New Auto-Interp
Negative Logits
care
-0.07
мыш
-0.07
.basic
-0.06
cigars
-0.06
Slovenia
-0.06
угод
-0.06
Tek
-0.06
.ON
-0.06
函数
-0.06
wheels
-0.06
POSITIVE LOGITS
Conan
0.06
.preview
0.06
碎
0.06
TASK
0.06
[root
0.06
jLabel
0.06
payouts
0.06
ráno
0.06
HAL
0.06
Crushing
0.06
Activations Density 0.004%