INDEX
    Explanations

    phrases emphasizing exclusivity or singularity

    New Auto-Interp
    Negative Logits
    ļ
    -1.83
    ĸ
    -1.73
     angles
    -1.64
    Ļ
    -1.59
     generations
    -1.57
     directions
    -1.54
    ¹
    -1.49
     steps
    -1.48
    ories
    -1.48
    rations
    -1.46
    POSITIVE LOGITS
    forge
    1.80
    upon
    1.61
    jam
    1.60
    CTX
    1.59
    yon
    1.58
    xiv
    1.57
     safely
    1.56
    GRP
    1.56
    quote
    1.56
    MTP
    1.55
    Act Density 0.312%

    No Known Activations