INDEX
Explanations
The neuron activates on the phrase “I think that,” i.e. expressions of personal opinion.
New Auto-Interp
Negative Logits
Yen
-0.07
,No
-0.07
��
-0.06
Encore
-0.06
Meer
-0.06
useMemo
-0.06
underwear
-0.06
avigate
-0.06
creek
-0.06
.ov
-0.06
POSITIVE LOGITS
that
0.10
that
0.08
That
0.08
THAT
0.07
That
0.07
-that
0.06
Pirate
0.06
Those
0.06
кап
0.06
cho
0.06
Activations Density 0.073%