INDEX
    Explanations

    elements of structured data or programming constructs

    New Auto-Interp
    Negative Logits
     text
    -0.21
     format
    -0.20
     name
    -0.19
     date
    -0.19
     document
    -0.18
     display
    -0.18
     content
    -0.18
     dash
    -0.18
     code
    -0.18
     day
    -0.18
    POSITIVE LOGITS
    dependency
    0.52
     dependency
    0.38
    Dependency
    0.37
    dependencies
    0.34
    ependency
    0.34
     Dependency
    0.33
    _dependency
    0.33
    (depend
    0.33
    dependence
    0.32
    -depend
    0.30
    Act Density 0.006%

    No Known Activations