INDEX
    Explanations

    attends to research-related tokens from development-related tokens

    New Auto-Interp
    Head Attr Weights
    0:0.15
    1:0.24
    2:0.16
    3:0.07
    4:0.10
    5:0.05
    6:0.08
    7:0.13
    Negative Logits
    XmlAccessType
    -0.33
    ValueStyle
    -0.31
     صوتيه
    -0.30
     dodatk
    -0.30
     للمعارف
    -0.30
    Scalars
    -0.30
     beginnetje
    -0.29
    tagHelperRunner
    -0.27
     autorytatywna
    -0.27
     stället
    -0.27
    POSITIVE LOGITS
     stay
    0.25
    JTable
    0.25
    stay
    0.25
    DRS
    0.25
    expandindo
    0.24
     skinned
    0.24
    Portale
    0.23
     IAM
    0.23
     Eft
    0.23
     COT
    0.23
    Act Density 0.376%

    No Known Activations