INDEX
    Explanations

    phrases related to a previous knowledge or predictions

    occurrences of the word "knew."

    New Auto-Interp
    Negative Logits
    otion
    -0.77
    orio
    -0.73
    phrine
    -0.72
    adies
    -0.72
    pmwiki
    -0.70
    ItemTracker
    -0.70
    otos
    -0.68
    adish
    -0.68
    psey
    -0.67
    pex
    -0.66
    POSITIVE LOGITS
     beforehand
    1.07
     instinctively
    0.94
     nothing
    0.76
    nothing
    0.74
    lege
    0.73
    ledged
    0.72
    footed
    0.70
    bones
    0.69
     firsthand
    0.67
    ledge
    0.67
    Act Density 0.064%

    No Known Activations