INDEX
Explanations
applicable
The neuron activates on occurrences of the word “applicable.”
New Auto-Interp
Negative Logits
BAS
-0.07
رفت
-0.07
$rows
-0.07
безопасности
-0.06
Bio
-0.06
DO
-0.06
_DO
-0.06
ад
-0.06
Json
-0.06
.ba
-0.06
POSITIVE LOGITS
applicable
0.14
licable
0.08
mystical
0.07
acle
0.07
prevalent
0.07
actical
0.07
Available
0.07
etsk
0.07
atical
0.07
ेच
0.07
Activations Density 0.005%