INDEX
Explanations
engaged in conflict or struggle
function words that serve as grammatical glue—especially prepositions and auxiliary verb contractions linking actions or clauses
argumentative dialogue
The neuron activates strongly on words that describe taking away or withholding possessions (e.g. “confiscation,” “footwear,” “defending,” “preoccupied”).
New Auto-Interp
Negative Logits
jml
0.43
OpenAI
0.40
projection
0.39
rb
0.39
indi
0.38
{:.0.38
im
0.37
ractive
0.37
INSTANCE
0.37
câncer
0.37
POSITIVE LOGITS
valiant
0.79
heinous
0.78
glorious
0.76
nefarious
0.75
vanqu
0.73
shenanigans
0.72
perilous
0.72
trusty
0.71
mighty
0.70
weaponry
0.69
Activations Density 0.059%