INDEX
Explanations
This neuron detects occurrences of the technical term “chip” in the text.
New Auto-Interp
Negative Logits
(down
-0.07
управ
-0.07
restoration
-0.07
endDate
-0.07
attended
-0.07
trị
-0.06
RU
-0.06
_lengths
-0.06
communicated
-0.06
forced
-0.06
POSITIVE LOGITS
chip
0.16
Chip
0.13
chips
0.12
Chips
0.10
chip
0.10
CHIP
0.09
Chip
0.09
CHIP
0.08
(chip
0.08
Chase
0.08
Activations Density 0.006%