INDEX
    Explanations

    mentions of "these" in various contexts

    New Auto-Interp
    Negative Logits
    dest
    -0.17
     dest
    -0.15
    rous
    -0.15
    kening
    -0.14
    iveau
    -0.14
     αÏħÏĦή
    -0.14
    ners
    -0.14
    Ë
    -0.14
    etry
    -0.13
    rol
    -0.13
    POSITIVE LOGITS
    curity
    0.32
    quence
    0.30
     days
    0.30
     kinds
    0.29
     same
    0.27
    verity
    0.27
    cond
    0.27
     sorts
    0.27
     guys
    0.26
     latter
    0.25
    Act Density 0.121%

    No Known Activations