INDEX
    Explanations

    references to novels and literature

    New Auto-Interp
    Negative Logits
     Keim
    -0.73
     equalization
    -0.72
    ^{\
    -0.69
     Eich
    -0.68
    on
    -0.66
     Tweede
    -0.64
     løs
    -0.61
     kork
    -0.60
    ings
    -0.58
     aprend
    -0.58
    POSITIVE LOGITS
     NOVEL
    1.01
     novels
    1.00
     Novel
    0.99
    Novel
    0.99
    theless
    0.97
    novel
    0.95
     Novels
    0.92
     novel
    0.91
    principalTable
    0.86
     weihnachten
    0.85
    Act Density 0.176%

    No Known Activations