INDEX
    Explanations

    phrases mentioning the concept of modern society or technology

    New Auto-Interp
    Negative Logits
    Bone
    -0.85
    kick
    -0.82
    REDACTED
    -0.77
    vana
    -0.73
    cius
    -0.73
    OTH
    -0.71
    ANY
    -0.70
    ï¸ı
    -0.70
    Jar
    -0.67
    Keys
    -0.66
    POSITIVE LOGITS
    isation
    1.16
    ity
    1.03
    ization
    0.98
    izations
    0.96
     incarnation
    0.94
    isers
    0.93
    izing
    0.91
     era
    0.88
    ising
    0.86
    itized
    0.85
    Act Density 0.025%

    No Known Activations