INDEX
    Explanations

    punctuation, specifically periods and various forms of regex patterns

    New Auto-Interp
    Negative Logits
    ibble
    -0.17
    rices
    -0.15
    ammers
    -0.14
    AKE
    -0.14
    ronics
    -0.14
    RAFT
    -0.13
     Adjustment
    -0.13
    ATUS
    -0.13
    лÑĮ
    -0.13
    antz
    -0.13
    POSITIVE LOGITS
    spl
    0.16
    635
    0.15
    eniable
    0.15
    ãĥ³ãĤ¬
    0.15
    ãĥ«ãĥī
    0.15
    .synthetic
    0.15
     encount
    0.15
    ovna
    0.15
    fade
    0.14
    Parking
    0.14
    Act Density 0.207%

    No Known Activations