INDEX
    Explanations

    information related to specific scenarios or events

    instances of the word "which" used to introduce clauses

    New Auto-Interp
    Negative Logits
    dj
    -0.72
    soType
    -0.71
    redit
    -0.70
    marg
    -0.68
     defic
    -0.67
    Roll
    -0.67
    bang
    -0.65
    sc
    -0.65
    sci
    -0.64
    rene
    -0.64
    POSITIVE LOGITS
    soever
    1.07
    upon
    0.86
     case
    0.82
    kson
    0.73
     cases
    0.71
     guts
    0.70
     contestants
    0.68
     personalities
    0.67
    andom
    0.67
     they
    0.64
    Act Density 0.036%

    No Known Activations