INDEX
Explanations
This neuron activates on occurrences of the word “employment” (and its morphological fragments like “employ”/“ment”).
New Auto-Interp
Negative Logits
trick
-0.08
292
-0.08
23
-0.08
22
-0.07
2
-0.07
decide
-0.07
33
-0.07
á
-0.07
decides
-0.07
3
-0.07
POSITIVE LOGITS
employment
0.12
Employment
0.10
Employ
0.09
Employer
0.09
Employ
0.09
employer
0.09
Emmanuel
0.08
SPELL
0.08
employ
0.08
employment
0.08
Activations Density 0.024%