INDEX
    Explanations

    attends to a token indicating a starting point within a broader context from a token specifying a particular location or reference later in the sequence

    New Auto-Interp
    Head Attr Weights
    0:0.12
    1:0.12
    2:0.11
    3:0.07
    4:0.04
    5:0.02
    6:0.05
    7:0.44
    Negative Logits
    Portale
    -0.37
    DockStyle
    -0.35
     BoxFit
    -0.34
    脚注の使い方
    -0.34
    anything
    -0.34
    StoryboardSegue
    -0.33
    UnusedPrivate
    -0.32
    
    -0.31
     anything
    -0.30
    شهاد
    -0.30
    POSITIVE LOGITS
    ]")]
    0.29
    paisaje
    0.28
    ppus
    0.27
     vacanze
    0.26
     скачать
    0.25
     teaming
    0.25
     <>",
    0.25
    ujesz
    0.24
     الحره
    0.24
    幸いです
    0.24
    Act Density 0.365%

    No Known Activations