INDEX
    Explanations

    phrases referring to specific objects or concepts

    instances of the word "this."

    New Auto-Interp
    Negative Logits
    osponsors
    -0.89
    emale
    -0.81
    ographies
    -0.81
    acers
    -0.79
    å§«
    -0.78
    ARDS
    -0.77
    aneers
    -0.76
    tops
    -0.76
    rights
    -0.75
    tones
    -0.75
    POSITIVE LOGITS
     nifty
    1.08
     adorable
    1.06
     delightful
    1.05
     lovely
    1.04
     amazing
    0.99
     gorgeous
    0.98
     hilarious
    0.97
     incredible
    0.97
     enigmatic
    0.94
     wonderful
    0.93
    Act Density 0.133%

    No Known Activations