INDEX
Explanations
The neuron primarily fires on common small “function” words (e.g. articles and prepositions like a, the, in, of, with) in these technical/patent‐style documents.
New Auto-Interp
Negative Logits
mu
-0.07
Fiat
-0.06
Astr
-0.06
ीएम
-0.06
def
-0.06
herit
-0.06
ücret
-0.06
_symbols
-0.06
erten
-0.06
tadır
-0.06
POSITIVE LOGITS
negativity
0.06
inflater
0.06
Britann
0.06
lac
0.06
QMainWindow
0.06
Bắc
0.06
Unlimited
0.06
DOI
0.06
beforeEach
0.06
vrch
0.06
Activations Density 0.073%