INDEX
    Explanations

    references to books and reading experiences

    New Auto-Interp
    Negative Logits
    ptal
    -0.15
    ideshow
    -0.15
     rehe
    -0.14
    ournée
    -0.14
    chr
    -0.14
     наÑĢ
    -0.14
    itten
    -0.14
    isé
    -0.13
    è¸ı
    -0.13
    Segue
    -0.13
    POSITIVE LOGITS
     reading
    0.35
     read
    0.30
     Reading
    0.28
    reading
    0.28
    Reading
    0.27
    读
    0.27
     ìĿ½
    0.27
     reads
    0.26
     book
    0.26
    reads
    0.26
    Act Density 0.153%

    No Known Activations