INDEX
    Explanations

    numerical values or quantities mentioned in a document

    comparisons and quantities expressed with "as" or similar phrasing

    New Auto-Interp
    Negative Logits
     secondly
    -0.61
     obligation
    -0.61
     additionally
    -0.59
     whatsoever
    -0.56
     unlaw
    -0.56
    IVES
    -0.55
    sworth
    -0.54
    smith
    -0.54
    ibl
    -0.53
     Bros
    -0.53
    POSITIVE LOGITS
     low
    1.16
     high
    1.05
    low
    1.03
    phy
    1.01
    ym
    0.99
    little
    0.98
     little
    0.96
    Low
    0.90
    ynchron
    0.89
     much
    0.88
    Act Density 0.100%

    No Known Activations