INDEX
    Explanations

    adjectives describing positive qualities or states

    instances of the word "good."

    New Auto-Interp
    Negative Logits
     Span
    -0.66
     span
    -0.63
     offshore
    -0.61
     mount
    -0.59
    ument
    -0.58
     accus
    -0.58
     blaze
    -0.58
     spans
    -0.58
     elevation
    -0.57
     tens
    -0.57
    POSITIVE LOGITS
    good
    3.92
    Good
    2.14
     GOOD
    1.85
     Good
    1.68
    bad
    1.64
    better
    1.59
     good
    1.58
    nice
    1.40
    great
    1.27
    Excellent
    1.25
    Act Density 0.006%

    No Known Activations