INDEX
    Explanations

    technical markers within the text, possibly indicative of formatting or metadata

    numeric values and their associated contexts

    New Auto-Interp
    Negative Logits
     favor
    -0.76
     favorably
    -0.74
     purs
    -0.74
    hovah
    -0.71
    ©¶æ
    -0.71
    ĪĴ
    -0.68
     honors
    -0.68
     behaviors
    -0.68
     solic
    -0.67
     favored
    -0.67
    POSITIVE LOGITS
    However
    1.25
    Meanwhile
    1.13
    Topics
    1.11
    Speaking
    1.09
    Writing
    1.08
    But
    1.07
    Instead
    1.06
    Read
    1.05
    Having
    1.04
    Asked
    1.04
    Act Density 0.490%

    No Known Activations