INDEX
    Explanations

    significant words and punctuation indicating philosophical or ethical concepts

    New Auto-Interp
    Negative Logits
    zen
    -0.16
    302
    -0.16
    atrice
    -0.15
    bah
    -0.15
    ensi
    -0.15
    aight
    -0.14
    omap
    -0.14
    bew
    -0.14
    leen
    -0.14
    tah
    -0.14
    POSITIVE LOGITS
    peg
    0.16
     ÏĢαÏģά
    0.15
     Storm
    0.14
    igon
    0.14
    rels
    0.14
     Strom
    0.14
    UPER
    0.14
    ayload
    0.13
    ich
    0.13
    .IsEmpty
    0.13
    Act Density 0.000%

    No Known Activations