INDEX
    Explanations

    negations and actions related to decision-making

    negative contractions and their usage in sentences

    New Auto-Interp
    Negative Logits
    20439
    -0.75
    Solution
    -0.74
    forcement
    -0.73
     nonetheless
    -0.73
    arb
    -0.72
    POL
    -0.70
    etimes
    -0.67
    onomy
    -0.67
    ãĤ¼
    -0.67
    evidence
    -0.66
    POSITIVE LOGITS
     typo
    0.93
     boring
    0.92
     shy
    0.91
     spoilers
    0.90
     fancy
    0.89
     duplication
    0.88
     bulky
    0.87
     gimm
    0.85
     pricey
    0.83
     disappoint
    0.81
    Act Density 0.575%

    No Known Activations