INDEX
    Explanations

    attends to tokens related to peace and subjects associated with religious or philosophical values from tokens referring to gendered pronouns or entities

    New Auto-Interp
    Head Attr Weights
    0:0.11
    1:0.13
    2:0.14
    3:0.07
    4:0.06
    5:0.06
    6:0.06
    7:0.33
    Negative Logits
    StoryboardSegue
    -0.48
    كويكب
    -0.40
    AsUp
    -0.40
    MigrationBuilder
    -0.38
    WithIOException
    -0.36
     "..\..\..\
    -0.36
     actionMode
    -0.35
     Meksiku
    -0.34
    aarrggbb
    -0.34
     VizieR
    -0.33
    POSITIVE LOGITS
    <_>
    0.29
    </h6>
    0.25
    </blockquote>
    0.24
    éras
    0.24
    Loh
    0.23
    getRole
    0.23
    Shiv
    0.22
    olta
    0.22
    0.21
    関する
    0.21
    Act Density 0.957%

    No Known Activations