INDEX
    Explanations

    phrases related to outcomes or consequences of actions and policies

    New Auto-Interp
    Negative Logits
    ENSITY
    -0.15
    ATEGORIES
    -0.15
    locker
    -0.14
    ldr
    -0.14
    yle
    -0.14
    gren
    -0.14
    iw
    -0.13
    aepernick
    -0.13
    -elements
    -0.13
     VARIABLES
    -0.13
    POSITIVE LOGITS
     increased
    0.25
     further
    0.21
     greater
    0.21
     decreased
    0.20
     eventual
    0.19
    vely
    0.18
     a
    0.17
     creation
    0.17
     an
    0.17
     corresponding
    0.16
    Act Density 0.144%

    No Known Activations