INDEX
    Explanations

    mentions of "reading" in various contexts

    New Auto-Interp
    Negative Logits
    odont
    -0.18
    İ
    -0.16
    dge
    -0.15
    ignet
    -0.14
    öl
    -0.14
     Kür
    -0.14
    èĪį
    -0.14
    uso
    -0.14
     Hunger
    -0.14
    ided
    -0.14
    POSITIVE LOGITS
    nable
    0.18
    illas
    0.16
    æģ
    0.16
    ableObject
    0.15
    tatus
    0.15
     tslint
    0.14
    immers
    0.14
    ÑĤÑĮ
    0.14
    ForResult
    0.14
    owler
    0.14
    Act Density 0.018%

    No Known Activations