INDEX
    Explanations

    references and mentions of concepts or items in the text

    New Auto-Interp
    Negative Logits
    parker
    -0.69
     guts
    -0.62
     alá
    -0.62
     Kamil
    -0.60
     recevrez
    -0.56
     Wei
    -0.56
     Wal
    -0.56
    STL
    -0.56
     livers
    -0.55
    Ellie
    -0.55
    POSITIVE LOGITS
     Mention
    1.76
     mention
    1.74
     mentions
    1.71
     Mentions
    1.68
     mentioning
    1.65
    Mention
    1.63
    mention
    1.56
     mentioned
    1.55
     Mentioned
    1.46
    mentions
    1.42
    Act Density 0.047%

    No Known Activations