INDEX
    Explanations

    phrases that refer to groups or categories, particularly indicated by the word "these."

    New Auto-Interp
    Negative Logits
    dest
    -0.15
    ìłģ
    -0.15
     dest
    -0.15
    ailability
    -0.14
    rol
    -0.14
     αÏħÏĦή
    -0.14
    ation
    -0.14
    Ë
    -0.14
    ener
    -0.13
    iveau
    -0.13
    POSITIVE LOGITS
    curity
    0.31
     same
    0.31
    quence
    0.30
     latter
    0.28
     kinds
    0.26
     sorts
    0.25
    cond
    0.25
    verity
    0.24
     days
    0.23
     guys
    0.23
    Act Density 0.116%

    No Known Activations