INDEX
    Explanations

    positive adjectives describing things as good, great, or beautiful

    New Auto-Interp
    Negative Logits
     indestru
    -1.04
     🤣🤣
    -0.99
     lavorato
    -0.97
     ricardo
    -0.96
     alberto
    -0.96
     ?...
    -0.96
     sergio
    -0.95
     jorge
    -0.93
     scoperto
    -0.92
    FTFY
    -0.91
    POSITIVE LOGITS
    <bos>
    0.97
     great
    0.94
    great
    0.86
     Great
    0.85
    Great
    0.82
     GREAT
    0.74
     excellent
    0.64
    GREAT
    0.63
     fantastic
    0.63
     good
    0.61
    Act Density 0.137%

    No Known Activations