INDEX
Explanations
specific phrases or indicators of causation and conditions surrounding events
The neuron detects the start-of-sequence / document position (it fires strongly on the <bos> token and other sequence-beginning positions).
New Auto-Interp
Negative Logits
بوابة
-0.60
small
-0.54
-0.51
small
-0.49
stable
-0.47
#+#
-0.46
रेटिंग
-0.46
Small
-0.45
inki
-0.45
bảng
-0.45
POSITIVE LOGITS
+#+#
0.90
AndEndTag
0.85
DockStyle
0.76
volna
0.73
'\\;'
0.73
setVerticalGroup
0.71
WithIOException
0.68
démarche
0.67
webElementXpaths
0.66
rungsseite
0.65
Activations Density 0.655%