INDEX
Explanations
The neuron fires on numeric literals (numbers) in the document.
New Auto-Interp
Negative Logits
lení
-0.07
Quality
-0.07
DEN
-0.07
negatives
-0.06
liquids
-0.06
-image
-0.06
Resolve
-0.06
Online
-0.06
bland
-0.06
prejudice
-0.06
POSITIVE LOGITS
]):
0.06
Might
0.06
libs
0.06
']",
0.06
мік
0.06
ATS
0.06
))]
0.06
'])?
0.06
تان
0.06
Trait
0.06
Activations Density 0.009%