INDEX
Explanations
Date and time formatting
This neuron does not detect any pattern—it never activates for any token.
New Auto-Interp
Negative Logits
ुरस
-0.07
dungeon
-0.06
isors
-0.06
dairy
-0.06
екотор
-0.06
XMLElement
-0.06
JOB
-0.06
Hu
-0.06
pta
-0.06
Leer
-0.06
POSITIVE LOGITS
़े
0.06
dày
0.06
sayı
0.06
-cal
0.06
размер
0.06
чемпіон
0.06
bigger
0.06
relying
0.06
ário
0.06
persuade
0.06
Activations Density 0.016%