INDEX
    Explanations

    phrases related to agreements or some form of official directives

    terms related to agreements and calls to action

    New Auto-Interp
    Negative Logits
    cale
    -0.77
    ecause
    -0.74
    vae
    -0.69
    ndum
    -0.67
    MORE
    -0.66
    agine
    -0.65
    poon
    -0.64
    ourke
    -0.62
    Interested
    -0.62
    stasy
    -0.62
    POSITIVE LOGITS
     itself
    0.82
    iest
    0.78
    ariat
    0.75
     consists
    0.71
     revolves
    0.70
     consisted
    0.70
     factor
    0.67
    ultimate
    0.66
     interval
    0.66
     seemed
    0.66
    Act Density 0.437%

    No Known Activations