INDEX
    Explanations

    expressions of optimism or future expectations

    New Auto-Interp
    Negative Logits
     alike
    -1.97
    §
    -1.85
    ĥ½
    -1.76
    PI
    -1.70
    erman
    -1.68
    )\].
    -1.62
    ERN
    -1.56
    ser
    -1.54
    atz
    -1.51
    PF
    -1.50
    POSITIVE LOGITS
     DAMAGE
    1.53
     Mn
    1.49
     Britain
    1.42
    yel
    1.38
     Void
    1.35
    balance
    1.34
    ghan
    1.33
     Unicode
    1.31
    bold
    1.29
     result
    1.27
    Act Density 0.014%

    No Known Activations