INDEX
Explanations
instances of uncertainty or questioning statements
The neuron detects tokens that occur at the start of sentences or answer lines — i.e., sentence-initial/discourse-starter tokens.
New Auto-Interp
Negative Logits
endphp
-1.07
enumi
-0.92
كومونز
-0.80
насељу
-0.80
'\\;'
-0.78
AnimationsModule
-0.77
resourceCulture
-0.76
pédie
-0.74
]")]
-0.74
posedge
-0.72
POSITIVE LOGITS
Absolutely
0.49
Exactly
0.48
Definitely
0.45
You
0.45
păr
0.43
jūsų
0.41
Exactly
0.41
felizes
0.41
Set
0.40
Absolutely
0.40
Activations Density 0.267%