INDEX
    Explanations

    quantitative values or measurements

    patterns of evaluation and contrast in narratives

    New Auto-Interp
    Negative Logits
    actionDate
    -0.66
     Babel
    -0.62
     Advice
    -0.61
    ãĤ©
    -0.60
     Photographer
    -0.60
     Canary
    -0.60
    ãĥĺ
    -0.59
     miscon
    -0.58
     Workers
    -0.57
     Lag
    -0.57
    POSITIVE LOGITS
     nevertheless
    0.93
     nonetheless
    0.83
    poons
    0.76
     thrive
    0.75
    este
    0.74
    lean
    0.72
    ISTER
    0.72
    eston
    0.68
     perfectly
    0.65
     thriving
    0.64
    Act Density 0.816%

    No Known Activations