INDEX
Explanations
This neuron does not respond to any tokens.
New Auto-Interp
Negative Logits
pills
-0.07
gardening
-0.07
Checkbox
-0.07
Bağ
-0.06
Bars
-0.06
Blacks
-0.06
Bytes
-0.06
่ย
-0.06
bay
-0.06
kinds
-0.06
POSITIVE LOGITS
of
0.08
of
0.07
さんの
0.07
상의
0.07
دهای
0.06
OF
0.06
های
0.06
olleyError
0.06
lookahead
0.06
ách
0.06
Activations Density 0.044%