INDEX
Explanations
The neuron fires on occurrences of the word “credit” (especially in “credit card” and related financial contexts).
New Auto-Interp
Negative Logits
JA
-0.09
Hanson
-0.08
jPanel
-0.07
Newman
-0.07
森
-0.07
JPanel
-0.07
_sess
-0.07
iomanip
-0.07
ops
-0.07
_none
-0.07
POSITIVE LOGITS
credit
0.15
Credit
0.13
credits
0.11
Credit
0.11
credit
0.10
credits
0.09
Credits
0.09
Credits
0.08
credited
0.08
credible
0.07
Activations Density 0.013%