INDEX
    Explanations

    attends to policy-related tokens from business-related tokens

    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.09
    2:0.09
    3:0.13
    4:0.11
    5:0.05
    6:0.24
    7:0.17
    Negative Logits
    SBATCH
    -0.36
     médicale
    -0.35
     froide
    -0.35
     äta
    -0.34
    utnik
    -0.33
     skydd
    -0.32
     féminine
    -0.32
    ownic
    -0.32
     chaude
    -0.31
     vectorielle
    -0.31
    POSITIVE LOGITS
    RuleContext
    0.37
     noqa
    0.32
    IntoConstraints
    0.32
    "}>
    0.32
    marshaller
    0.32
    0.29
    }`}>
    0.29
    uke
    0.28
     referenties
    0.28
    ging
    0.28
    Act Density 0.055%

    No Known Activations