INDEX
    Explanations

    conclusive terms indicating summaries or final thoughts in a text

    New Auto-Interp
    Negative Logits
     also
    -0.92
     never
    -0.73
     now
    -0.71
     sometimes
    -0.68
     still
    -0.67
     always
    -0.61
     finally
    -0.55
     probably
    -0.54
     ever
    -0.53
     see
    -0.52
    POSITIVE LOGITS
     Moreover
    1.49
     However
    1.47
     Furthermore
    1.46
    Furthermore
    1.45
    Moreover
    1.44
     Nevertheless
    1.43
     Therefore
    1.42
     Additionally
    1.41
    Additionally
    1.41
     Accordingly
    1.41
    Act Density 0.265%

    No Known Activations