INDEX
    Explanations

    words related to people, such as verbs like 'are' and 'were'

    the verb "to be" in different forms and contexts

    New Auto-Interp
    Negative Logits
    pedia
    -0.75
    imation
    -0.70
    ricks
    -0.68
     Geh
    -0.63
    ooters
    -0.62
    Footnote
    -0.62
     goodbye
    -0.62
    fail
    -0.62
     cancellation
    -0.61
    wake
    -0.60
    POSITIVE LOGITS
    tein
    0.82
     fluent
    0.73
    dinand
    0.72
     supposed
    0.70
    held
    0.67
     subscribed
    0.67
     behold
    0.67
     married
    0.67
     caught
    0.66
     blinded
    0.66
    Act Density 0.186%

    No Known Activations