INDEX
    Explanations

    named entities, such as names of cities, people, organizations, and dates

    references to specific dates and historical facts

    New Auto-Interp
    Negative Logits
    icter
    -0.78
    ļéĨĴ
    -0.74
     forgetting
    -0.73
     takeaway
    -0.68
     worrying
    -0.68
     tack
    -0.67
    urances
    -0.67
    iment
    -0.66
     WARN
    -0.64
     inexper
    -0.63
    POSITIVE LOGITS
     originated
    0.97
     Originally
    0.90
     refers
    0.88
     Located
    0.87
     aka
    0.82
     coined
    0.82
     Known
    0.81
     Became
    0.79
     represents
    0.75
     Travels
    0.75
    Act Density 0.389%

    No Known Activations