INDEX
    Explanations

    technical specifications and data formatting within documents

    New Auto-Interp
    Negative Logits
     Paz
    -0.17
    abis
    -0.16
    енÑĮ
    -0.16
    AFX
    -0.15
    rez
    -0.15
     Oswald
    -0.15
     kav
    -0.15
     gib
    -0.15
    andbox
    -0.14
    enity
    -0.14
    POSITIVE LOGITS
    153
    0.77
    152
    0.75
    154
    0.74
    150
    0.73
    155
    0.72
    151
    0.72
    156
    0.72
    157
    0.66
    158
    0.63
    159
    0.61
    Act Density 0.085%

    No Known Activations