INDEX
Explanations
calculating remainders
This neuron fires on the numeric tokens that represent the computed remainders in “What is the remainder …” style questions.
New Auto-Interp
Negative Logits
intimidating
-0.06
servers
-0.06
Hin
-0.06
-driver
-0.06
있던
-0.06
dag
-0.06
rss
-0.06
دارای
-0.06
jwt
-0.06
':
-0.06
POSITIVE LOGITS
HTML
0.06
Train
0.06
автомоб
0.06
Expense
0.06
Nickel
0.06
冬
0.06
Cha
0.06
richt
0.06
Voice
0.06
↵↵↵↵↵↵↵↵↵↵↵
0.06
Activations Density 0.005%