INDEX
    Explanations

    attends to actions and attempts related to various verbs from subsequent tokens describing the action or the outcome

    New Auto-Interp
    Head Attr Weights
    0:0.16
    1:0.21
    2:0.20
    3:0.05
    4:0.03
    5:0.02
    6:0.04
    7:0.25
    Negative Logits
     Réponses
    -0.28
    GEBURTSDATUM
    -0.28
    INSEE
    -0.27
    ároz
    -0.22
     BoxFit
    -0.22
    Примечания
    -0.21
    Földrajzportál
    -0.21
    ++];
    -0.20
    MigrationBuilder
    -0.20
     Marks
    -0.20
    POSITIVE LOGITS
    awtextra
    0.32
     rospy
    0.29
     smtplib
    0.28
    lemented
    0.28
    Revenir
    0.27
    HasAnnotation
    0.27
    MLLoader
    0.27
    cabulary
    0.26
    ряда
    0.26
     ειδ
    0.26
    Act Density 0.272%

    No Known Activations