INDEX
    Explanations

    phrases related to writing activities

    instances of the word "write" in various forms and contexts

    New Auto-Interp
    Negative Logits
    Ĭ±
    -0.82
    eger
    -0.82
    illon
    -0.75
    alo
    -0.74
    Unity
    -0.73
     ILCS
    -0.71
    aband
    -0.70
    agara
    -0.69
    azar
    -0.67
    EGA
    -0.67
    POSITIVE LOGITS
     poems
    0.85
    writer
    0.83
    Write
    0.80
    manship
    0.79
     journal
    0.79
    smanship
    0.79
     poem
    0.78
    writing
    0.77
    wrote
    0.77
    writ
    0.77
    Act Density 0.033%

    No Known Activations