INDEX
    Explanations

    attends to argumentative terms from negating or contrasting tokens

    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.11
    2:0.10
    3:0.10
    4:0.06
    5:0.02
    6:0.22
    7:0.27
    Negative Logits
     EconPapers
    -0.40
     <=",
    -0.38
     незавершена
    -0.34
    IVEREF
    -0.34
    MLLoader
    -0.32
    TagMode
    -0.32
    TypedDataSet
    -0.32
    بوابة
    -0.32
    AutoScaleMode
    -0.31
     Paglinawan
    -0.31
    POSITIVE LOGITS
    xffffff
    0.26
     архивлан
    0.23
    rably
    0.23
     Engineered
    0.23
     proč
    0.23
    RestTemplate
    0.23
    erialized
    0.22
    FormTagHelper
    0.22
    itized
    0.22
    --){
    0.22
    Act Density 0.478%

    No Known Activations