INDEX
    Explanations

    expressions related to uncertainty or inconclusiveness

    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.03
    2:0.02
    3:0.05
    4:0.02
    5:0.05
    6:0.34
    7:0.03
    8:0.02
    9:0.03
    10:0.10
    11:0.23
    Negative Logits
    ––
    -4.31
    -4.30
    \\
    -4.02
    ­
    -4.01
    \\\\
    -3.81
    ||
    -3.65
    -3.60
     ­
    -3.50
    —-
    -3.48
    -3.47
    POSITIVE LOGITS
     Templ
    2.88
    ulkan
    2.88
     Anim
    2.87
     Kyoto
    2.81
    mares
    2.65
     McH
    2.61
     Yad
    2.60
     mum
    2.59
     Metatron
    2.58
     MLB
    2.58
    Act Density 0.359%

    No Known Activations