INDEX
    Explanations

    punctuation and structural elements in written language

    New Auto-Interp
    Negative Logits
    isoft
    -0.14
    sie
    -0.13
     Deniz
    -0.13
     Yer
    -0.13
    .Constant
    -0.13
    proper
    -0.13
    izzling
    -0.13
     Ø¢ÛĮ
    -0.13
     Bentley
    -0.13
    avis
    -0.13
    POSITIVE LOGITS
    blr
    0.17
    razier
    0.14
    indow
    0.14
    emmel
    0.14
    apel
    0.14
    agal
    0.14
    ibri
    0.14
    à¸Ńà¸Ń
    0.14
    ach
    0.14
    COPY
    0.14
    Act Density 0.004%

    No Known Activations