INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     nut
    -0.10
     cartoon
    -0.10
     Poetry
    -0.09
    æī¶
    -0.09
    ooke
    -0.09
     Cartoon
    -0.09
     slack
    -0.09
     Pam
    -0.09
     Final
    -0.09
     cartoons
    -0.08
    POSITIVE LOGITS
     realistic
    0.17
     novel
    0.16
     novels
    0.15
     realism
    0.15
    nov
    0.12
    ãĥªãĤ¢
    0.11
     roman
    0.11
     Novel
    0.10
     Sinclair
    0.10
     sentimental
    0.10
    Act Density 0.058%

    No Known Activations