INDEX
    Explanations

    phrases related to cause and effect

    specific recurring phrases or words that suggest rules and consequences in a narrative

    New Auto-Interp
    Negative Logits
     Historically
    -0.76
    abee
    -0.75
    ibaba
    -0.75
    estone
    -0.74
    inspired
    -0.72
    allel
    -0.71
    Columb
    -0.70
    derived
    -0.69
    cum
    -0.68
    umen
    -0.68
    POSITIVE LOGITS
     consequences
    1.22
     slightest
    1.22
     rest
    1.15
     odds
    1.11
     guy
    1.10
     repercussions
    1.05
     fuck
    1.05
     situation
    1.04
     truth
    1.04
     outcome
    1.02
    Act Density 0.385%

    No Known Activations