INDEX
    Explanations

    HTML-related tags and attributes

    New Auto-Interp
    Negative Logits
     classes
    -0.34
     Classes
    -0.32
    classes
    -0.32
    -class
    -0.32
    classname
    -0.30
    -Class
    -0.30
    .classes
    -0.29
     клаÑģÑģ
    -0.29
    /classes
    -0.28
    klass
    -0.28
    POSITIVE LOGITS
     data
    0.24
     style
    0.23
    style
    0.23
     role
    0.21
     aria
    0.19
    data
    0.19
     Style
    0.19
     STYLE
    0.18
    Style
    0.17
     id
    0.17
    Act Density 0.016%

    No Known Activations