INDEX
    Explanations

    self-referential phrases and expressions of sentiment

    New Auto-Interp
    Negative Logits
    MathML
    -0.48
    Supported
    -0.32
    supported
    -0.32
    displaystyle
    -0.30
    spoken
    -0.28
    HomeAsUpEnabled
    -0.28
    Received
    -0.28
     witnessed
    -0.28
     bore
    -0.27
     Supported
    -0.26
    POSITIVE LOGITS
     added
    0.99
     Added
    0.95
    added
    0.90
     modified
    0.88
     removed
    0.87
    Added
    0.86
     selected
    0.85
     ADDED
    0.84
     tweaked
    0.83
    removed
    0.82
    Act Density 0.731%

    No Known Activations