INDEX
    Explanations

    attributes describing objects, particularly focusing on adjectives and their qualitative descriptors

    New Auto-Interp
    Negative Logits
    arrant
    -0.16
    .opensource
    -0.15
    меÑī
    -0.14
    ÄĽÅĻ
    -0.14
    elt
    -0.14
    raci
    -0.14
    ibold
    -0.14
     removeFromSuperview
    -0.14
    ÑĤаб
    -0.14
     halluc
    -0.13
    POSITIVE LOGITS
    rott
    0.17
    oria
    0.17
    olem
    0.16
    åı
    0.15
    inke
    0.15
    alem
    0.14
    edin
    0.14
    oes
    0.14
    amar
    0.14
     Urb
    0.14
    Act Density 0.206%

    No Known Activations