INDEX
    Explanations

    the word "believe"

    expressions of disbelief or skepticism

    New Auto-Interp
    Negative Logits
    conservancy
    -0.79
    ague
    -0.76
    pmwiki
    -0.73
    accompan
    -0.64
    mentioned
    -0.63
    cloth
    -0.63
    practice
    -0.61
    aste
    -0.59
     Shed
    -0.59
    nec
    -0.59
    POSITIVE LOGITS
    fulness
    0.88
    ieve
    0.86
    rill
    0.81
    ulous
    0.79
    ief
    0.76
    itious
    0.75
    rition
    0.74
    enance
    0.73
    ieving
    0.72
    ership
    0.69
    Act Density 0.042%

    No Known Activations