INDEX
    Explanations

    attends to actions or effects related to aggressive movements or strikes from non-dominant tokens

    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.09
    2:0.09
    3:0.16
    4:0.13
    5:0.08
    6:0.18
    7:0.15
    Negative Logits
    AnchorTagHelper
    -0.35
     betweenstory
    -0.28
    onomie
    -0.27
     становника
    -0.26
    ViewFeatures
    -0.26
    minipage
    -0.25
    PutMapping
    -0.25
    NOPQRST
    -0.25
     Lazar
    -0.25
    ภูมิ
    -0.25
    POSITIVE LOGITS
     AssemblyTitle
    0.33
     JpaRepository
    0.28
    seteq
    0.28
    Chham
    0.28
    อะไร
    0.27
    ciclop
    0.27
     שוליים
    0.26
    homonymie
    0.26
     forskj
    0.26
    thene
    0.26
    Act Density 0.116%

    No Known Activations