INDEX
    Explanations

    noun phrases indicating existence or identity

    New Auto-Interp
    Negative Logits
    alam
    -0.16
     cad
    -0.14
    chu
    -0.14
    forge
    -0.14
    rees
    -0.13
    office
    -0.13
    .va
    -0.13
     sez
    -0.13
    Anywhere
    -0.13
    illon
    -0.13
    POSITIVE LOGITS
    .slim
    0.19
    akin
    0.16
    hlen
    0.16
    ulos
    0.15
     ÙĦÙĨ
    0.15
    opsis
    0.15
    ütün
    0.14
    .INSTANCE
    0.14
    opot
    0.14
    IMARY
    0.14
    Act Density 0.020%

    No Known Activations