INDEX
    Explanations

    references to notable authors and literary figures

    New Auto-Interp
    Negative Logits
    549
    -0.15
    eldorf
    -0.14
    íĽĦ
    -0.14
    onz
    -0.14
     Carly
    -0.14
    iglia
    -0.14
    HL
    -0.14
    747
    -0.14
    æĶ¹éĿ©
    -0.13
    ktion
    -0.13
    POSITIVE LOGITS
     Fantastic
    0.24
     Chamber
    0.23
    Fantastic
    0.22
    umbledore
    0.19
     Sor
    0.18
     Rowling
    0.18
     chamber
    0.18
    hog
    0.18
    DH
    0.18
     Klo
    0.18
    Act Density 0.008%

    No Known Activations