INDEX
Explanations
This neuron responds to intensive quantifier phrases—words that emphasize large amounts (e.g. “lots,” “lots of,” “detail”).
New Auto-Interp
Negative Logits
mask
-0.06
bag
-0.06
Brad
-0.06
еса
-0.05
bp
-0.05
Owl
-0.05
गय
-0.05
Fuel
-0.05
Ow
-0.05
hit
-0.05
POSITIVE LOGITS
UGHT
0.08
บรร
0.08
ایسه
0.08
ahrenheit
0.07
!(
0.07
attitudes
0.07
rió
0.07
obbies
0.07
meses
0.07
생활
0.07
Activations Density 0.001%