INDEX
Explanations
Intensity, Speed
The neuron is looking for superlative or extreme comparative adjectives (e.g. “largest,” “smallest,” “farthest,” “closest,” “heaviest,” “lightest”) that denote quantitative extremes.
New Auto-Interp
Negative Logits
300
-0.08
36
-0.07
ить
-0.07
basics
-0.06
283
-0.06
WITH
-0.06
MIL
-0.06
Brigham
-0.06
PK
-0.06
150
-0.06
POSITIVE LOGITS
おり
0.07
lsruhe
0.07
.MiddleRight
0.06
sentimental
0.06
cleared
0.06
lis
0.06
_once
0.06
@api
0.06
】,
0.06
ิวเตอร
0.06
Activations Density 0.099%