INDEX
    Explanations

    superlatives and positive adjectives related to various objects and experiences

    instances of the word "The" likely indicating a focus on prominent features or highlights

    New Auto-Interp
    Negative Logits
    thood
    -0.83
     voluntarily
    -0.79
     thereby
    -0.79
    Ó
    -0.77
    ailable
    -0.76
    ashington
    -0.74
    rolet
    -0.73
    stretched
    -0.73
    leground
    -0.72
     according
    -0.72
    POSITIVE LOGITS
     biggest
    1.23
    oret
    1.16
     downside
    1.16
     hardest
    1.14
     coolest
    1.12
    resa
    1.05
     easiest
    1.05
     thing
    1.04
     slightest
    1.03
     irony
    1.00
    Act Density 0.405%

    No Known Activations