INDEX
    Explanations

    phrases indicating prevention or avoidance of negative outcomes

    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.01
    2:0.07
    3:0.08
    4:0.28
    5:0.02
    6:0.03
    7:0.25
    8:0.03
    9:0.03
    10:0.05
    11:0.07
    Negative Logits
    AppData
    -1.80
    cloth
    -1.73
    onyms
    -1.57
    database
    -1.56
    ebook
    -1.53
     Yaz
    -1.44
     Kardash
    -1.43
     Palest
    -1.43
     Qur
    -1.41
     Quran
    -1.40
    POSITIVE LOGITS
     hostilities
    1.95
     deterioration
    1.88
     overload
    1.87
     inevitable
    1.87
     failure
    1.85
     impending
    1.83
     revolt
    1.82
     erosion
    1.78
     disruption
    1.77
     backlash
    1.71
    Act Density 0.000%

    No Known Activations