INDEX
    Explanations

    punctuation marks

    New Auto-Interp
    Negative Logits
     Ribbon
    -0.52
    cule
    -0.52
     Gateway
    -0.50
     Quad
    -0.50
    itialized
    -0.49
    ulum
    -0.48
     Blueprint
    -0.48
     blockbuster
    -0.48
     Coalition
    -0.48
     acronym
    -0.47
    POSITIVE LOGITS
     etc
    1.04
    etc
    0.99
     lest
    0.95
     thereby
    0.93
     whereas
    0.91
     respectively
    0.88
    thus
    0.83
    while
    0.83
    according
    0.82
    instead
    0.81
    Act Density 0.603%

    No Known Activations