INDEX
Explanations
group comparisons
This neuron activates on the labels and identifiers of study groups (e.g., “Group I,” “II,” “A,” “B,” etc.).
New Auto-Interp
Negative Logits
لام
-0.07
ysql
-0.06
train
-0.06
ollah
-0.06
rain
-0.06
El
-0.06
kesinlikle
-0.06
Boeh
-0.06
einem
-0.06
eta
-0.06
POSITIVE LOGITS
rusty
0.07
retract
0.06
bergen
0.06
خاطر
0.06
_PUT
0.06
وية
0.06
//}↵↵
0.06
типу
0.06
<{↵0.06
:view
0.06
Activations Density 0.040%