INDEX
Explanations
This neuron detects code tokens involved in loading pretrained models, in particular the “.from_pretrained” method calls.
New Auto-Interp
Negative Logits
ikk
-0.07
Festival
-0.07
_print
-0.07
هند
-0.07
.att
-0.07
�
-0.06
extraction
-0.06
Strateg
-0.06
visitors
-0.06
bryster
-0.06
POSITIVE LOGITS
_than
0.06
Cancelable
0.06
Animals
0.06
jsonResponse
0.06
Common
0.06
(entry
0.06
} ↵ ↵ ↵ ↵
0.06
TARGET
0.06
0.06
conj
0.06
Activations Density 0.002%