INDEX
    Explanations

    phrases indicating knowledge or awareness of something

    instances of the word "know" in various contexts

    New Auto-Interp
    Negative Logits
    phrine
    -0.82
    oples
    -0.82
    ItemTracker
    -0.81
    interstitial
    -0.79
    otion
    -0.76
    conservancy
    -0.73
    pite
    -0.71
     Yugoslavia
    -0.71
    atism
    -0.70
    aredevil
    -0.69
    POSITIVE LOGITS
    ledge
    1.14
    ledged
    1.06
    lege
    1.02
    LED
    0.91
     beforehand
    0.77
    ariat
    0.76
     how
    0.76
    edge
    0.76
    hent
    0.72
    how
    0.70
    Act Density 0.064%

    No Known Activations