INDEX
    Explanations

    sections of text related to events and outcomes

    after "the" exhibiting positive sentiment

    positive adjectives after determiners

    New Auto-Interp
    Negative Logits
    dafx
    -0.61
    OGND
    -0.56
    Jegyzetek
    -0.55
    ItemBackground
    -0.54
    ệc
    -0.54
    Hauptartikel
    -0.52
    rawtypes
    -0.52
    دانشنامهٔ
    -0.52
    )]$
    -0.52
    principalTable
    -0.51
    POSITIVE LOGITS
     wonderful
    2.33
     amazing
    2.08
     lovely
    2.04
    wonderful
    2.00
     beautiful
    1.99
     marvellous
    1.96
     marvelous
    1.96
     magnificent
    1.87
     fabulous
    1.85
     delightful
    1.85
    Act Density 0.252%

    No Known Activations