INDEX
    Explanations

    numeric values and formatting, likely related to data presentation or citations

    New Auto-Interp
    Negative Logits
    0
    -0.73
    1
    -0.72
    4
    -0.70
    3
    -0.67
    2
    -0.67
    5
    -0.65
    7
    -0.64
    HasAnnotation
    -0.63
    9
    -0.63
    8
    -0.61
    POSITIVE LOGITS
    posedge
    0.82
    ICEF
    0.76
    <![
    0.65
    ſhip
    0.65
    ArrowToggle
    0.62
    BagLayout
    0.60
    المناصب
    0.60
     الرياضيه
    0.59
    openhague
    0.59
    Искәрмәләр
    0.58
    Act Density 0.307%

    No Known Activations