INDEX
    Explanations

    passages indicating that something has been written

    instances of the word "written" in various contexts

    New Auto-Interp
    Negative Logits
    abe
    -0.86
    Ĭ±
    -0.85
    nel
    -0.82
    ugal
    -0.80
     Shinra
    -0.75
    elli
    -0.73
    illon
    -0.73
    allows
    -0.72
    isters
    -0.71
    alin
    -0.70
    POSITIVE LOGITS
     essays
    0.75
     instrument
    0.73
     breath
    0.72
    acters
    0.71
     essay
    0.70
     excerpts
    0.69
     written
    0.69
     typed
    0.68
     aloud
    0.67
    itatively
    0.66
    Act Density 0.025%

    No Known Activations