INDEX
    Explanations

    attends to opinion-related tokens from development-related tokens

    New Auto-Interp
    Head Attr Weights
    0:0.11
    1:0.14
    2:0.12
    3:0.12
    4:0.13
    5:0.05
    6:0.12
    7:0.17
    Negative Logits
    TestingModule
    -0.38
     дописавши
    -0.28
     féd
    -0.27
    theid
    -0.24
     eloku
    -0.24
    Външни
    -0.23
    IUrlHelper
    -0.23
    protoimpl
    -0.23
    ولة
    -0.23
    øv
    -0.23
    POSITIVE LOGITS
    󠁿
    0.28
    !';
    0.28
     تضيفلها
    0.26
    cillors
    0.26
    );*/
    0.26
     مشين
    0.26
    })`
    0.26
    CloseOperation
    0.25
    ();*/
    0.25
    !».
    0.25
    Act Density 0.168%

    No Known Activations