INDEX
    Explanations

    specific nouns and phrases indicating actions or states in various contexts

    New Auto-Interp
    Negative Logits
    .Manifest
    -0.06
    é©
    -0.06
    yne
    -0.06
     ÑĦÑĥн
    -0.06
    ç¬
    -0.06
     UIApplication
    -0.06
    .dense
    -0.06
    xcb
    -0.06
     fenced
    -0.06
    entai
    -0.06
    POSITIVE LOGITS
    Äħd
    0.07
     Hayward
    0.07
    legg
    0.07
    anus
    0.07
    erus
    0.06
    ίÏĦ
    0.06
    angl
    0.06
    czy
    0.06
    geois
    0.06
    akin
    0.06
    Act Density 0.004%

    No Known Activations