INDEX
    Explanations

    mentions of writers

    occurrences of the word "writer" in various contexts

    New Auto-Interp
    Negative Logits
    ADRA
    -0.79
    rals
    -0.77
    illon
    -0.76
     Lumpur
    -0.73
    aband
    -0.72
    ĸļ
    -0.71
    ypes
    -0.70
    ibaba
    -0.70
    eneg
    -0.70
    asonic
    -0.68
    POSITIVE LOGITS
     writer
    0.95
    writer
    0.94
     laureate
    0.91
    uscript
    0.87
     fiction
    0.86
    writing
    0.85
    writ
    0.82
    haw
    0.81
     Beware
    0.78
    itatively
    0.76
    Act Density 0.028%

    No Known Activations