INDEX
    Explanations

    comparative adjectives

    the word "even" in various contexts emphasizing comparison or degree

    New Auto-Interp
    Negative Logits
    units
    -0.78
    mson
    -0.77
    ATURES
    -0.76
    artments
    -0.76
    utics
    -0.74
    apons
    -0.74
    unia
    -0.72
     Libraries
    -0.71
    hops
    -0.70
     SPORTS
    -0.68
    POSITIVE LOGITS
     spoiler
    0.80
     underdog
    0.75
     explanation
    0.74
     approximation
    0.73
     hitter
    0.72
     temper
    0.70
     sequel
    0.69
     excerpt
    0.69
     variant
    0.69
     uphill
    0.69
    Act Density 0.114%

    No Known Activations