INDEX
    Explanations

    attends to the relations expressed through specific prepositions or phrases linking concepts and categories from tokens that appear later in the sequence

    New Auto-Interp
    Head Attr Weights
    0:0.10
    1:0.11
    2:0.43
    3:0.05
    4:0.04
    5:0.04
    6:0.04
    7:0.13
    Negative Logits
    ✨:
    -0.46
    AttributeSet
    -0.41
     insuffisamment
    -0.40
     Roskov
    -0.40
    InstanceState
    -0.39
    ########.
    -0.37
    .*")]
    -0.36
     فريبيس
    -0.35
     doInBackground
    -0.35
     Waray
    -0.33
    POSITIVE LOGITS
     medesimo
    0.34
    AddTagHelper
    0.32
    SequentialGroup
    0.30
     grunn
    0.30
    awtextra
    0.29
    Cubit
    0.29
    plate
    0.26
     opdracht
    0.25
     incentive
    0.25
    WriteTagHelper
    0.25
    Act Density 1.093%

    No Known Activations