INDEX
    Explanations

    references to the number one or the concept of singularity

    New Auto-Interp
    Negative Logits
    UnusedPrivate
    -0.67
    ly
    -0.67
     Bettina
    -0.66
     ―――――
    -0.66
     raiſ
    -0.65
     ſch
    -0.64
     대해
    -0.62
    apunov
    -0.62
     București
    -0.61
    Cec
    -0.60
    POSITIVE LOGITS
     One
    1.18
     ONE
    1.17
    One
    1.10
     one
    1.05
    one
    0.99
    ONE
    0.99
    updateOne
    0.85
     jeden
    0.84
    WithMany
    0.83
     hundred
    0.82
    Act Density 0.158%

    No Known Activations