INDEX
    Explanations

    repetitive phrases used for comparison or context in a text

    occurrences of the word "same."

    New Auto-Interp
    Negative Logits
    ;;;;
    -0.72
    *=-
    -0.70
    ËĪ
    -0.69
    ãĤ´ãĥ³
    -0.68
    urally
    -0.68
    export
    -0.67
    icum
    -0.66
     Khe
    -0.65
    rend
    -0.64
    their
    -0.64
    POSITIVE LOGITS
     thing
    0.90
     vein
    0.88
     applies
    0.86
     caveats
    0.84
     principle
    0.84
     kind
    0.80
     principles
    0.79
     reasoning
    0.78
     exact
    0.77
     fate
    0.77
    Act Density 0.042%

    No Known Activations