INDEX
Explanations
This neuron fires strongly on the occurrences (and sub‐tokens) of the mathematical term “polytope.”
New Auto-Interp
Negative Logits
、これ
-0.07
.Column
-0.07
closer
-0.06
.AsyncTask
-0.06
Ferrari
-0.06
negligible
-0.06
scn
-0.06
War
-0.06
ीकरण
-0.06
girls
-0.06
POSITIVE LOGITS
atables
0.07
submitting
0.07
.Lib
0.06
-seeking
0.06
Brewing
0.06
owego
0.06
salle
0.06
-mod
0.06
IDA
0.06
eth
0.06
Activations Density 0.004%