INDEX
    Explanations

    instances where the text discusses the implications or consequences of certain actions or decisions

    references to the word "it" often in contexts suggesting discussion of a subject or object

    New Auto-Interp
    Negative Logits
    UGH
    -0.71
    OME
    -0.69
    IFE
    -0.66
     Finish
    -0.64
    Hope
    -0.64
    FUL
    -0.62
    Flight
    -0.61
    Magikarp
    -0.61
     Genius
    -0.61
    hift
    -0.61
    POSITIVE LOGITS
     involves
    1.33
     relates
    1.32
     contradicts
    1.28
     coincides
    1.23
     represents
    1.22
     embodies
    1.18
     violates
    1.18
     contains
    1.17
     resembles
    1.16
     reflects
    1.15
    Act Density 0.216%

    No Known Activations