INDEX
    Explanations

    names of individuals and their relationships or accomplishments

    New Auto-Interp
    Negative Logits
    vier
    -0.15
    ê³
    -0.15
    xaa
    -0.14
    avier
    -0.13
    bout
    -0.13
    اØ
    -0.13
    .Mutable
    -0.13
    blog
    -0.13
    Named
    -0.13
    bao
    -0.13
    POSITIVE LOGITS
     born
    0.23
    188
    0.20
     Born
    0.18
    184
    0.17
    189
    0.17
     papers
    0.16
    192
    0.16
    190
    0.16
    183
    0.16
     Papers
    0.15
    Act Density 0.131%

    No Known Activations