INDEX
    Explanations

    descriptors related to physical properties and durability of objects

    New Auto-Interp
    Negative Logits
    782
    -0.15
    个
    -0.15
    587
    -0.14
     ourselves
    -0.14
    649
    -0.14
    adele
    -0.14
    779
    -0.14
    inged
    -0.14
    edelta
    -0.14
    ικο
    -0.13
    POSITIVE LOGITS
     itself
    0.27
     unlike
    0.17
    /Foundation
    0.16
     readily
    0.16
    YST
    0.15
    cÃŃ
    0.15
    anner
    0.15
    _Tis
    0.14
    #af
    0.14
    abin
    0.14
    Act Density 0.216%

    No Known Activations