INDEX
    Explanations

    references to films and their adaptations from books

    New Auto-Interp
    Negative Logits
     McKay
    -0.18
    AVE
    -0.17
    submenu
    -0.16
     Narr
    -0.15
    adero
    -0.14
    avec
    -0.14
    ajan
    -0.14
    ictor
    -0.14
     Rosenstein
    -0.14
    ennen
    -0.14
    POSITIVE LOGITS
    reader
    0.16
     Wong
    0.15
    Reader
    0.15
     lok
    0.14
    fall
    0.13
     reader
    0.13
    Sn
    0.13
    LabelText
    0.13
    ArrayOf
    0.13
    cele
    0.13
    Act Density 0.065%

    No Known Activations