INDEX
    Explanations

    references to specific names and titles

    New Auto-Interp
    Negative Logits
     Pron
    -0.17
    oup
    -0.15
    anden
    -0.15
    (named
    -0.14
    зÑĥ
    -0.14
    paragraph
    -0.14
    named
    -0.14
    endar
    -0.14
     Named
    -0.14
    olver
    -0.14
    POSITIVE LOGITS
     sob
    0.21
     alternate
    0.21
     alternative
    0.21
     name
    0.20
    sob
    0.19
     generic
    0.18
    alternative
    0.18
    alternate
    0.17
     appell
    0.17
     Alternative
    0.17
    Act Density 0.060%

    No Known Activations