INDEX
    Explanations

    appropriate nouns and proper nouns

    phrases that refer to significant people, events, or entities in context

    New Auto-Interp
    Negative Logits
    ruction
    -0.61
    aughter
    -0.57
     Coverage
    -0.56
    "],
    -0.55
     [];
    -0.55
    :
    -0.55
    ombat
    -0.54
    EStreamFrame
    -0.54
    lly
    -0.54
    rations
    -0.53
    POSITIVE LOGITS
     aka
    0.88
     albeit
    0.87
     alas
    0.80
     hitherto
    0.72
     unlike
    0.72
     namely
    0.70
     fearing
    0.70
     coupled
    0.69
    whatever
    0.67
     despite
    0.67
    Act Density 0.295%

    No Known Activations