INDEX
    Explanations

    references to historical figures and their biographical details

    New Auto-Interp
    Negative Logits
     Folding
    -0.07
    ocket
    -0.06
     Fold
    -0.06
     fold
    -0.06
    abbr
    -0.06
    .enterprise
    -0.06
    hazi
    -0.06
     inter
    -0.06
    cente
    -0.06
    ustin
    -0.06
    POSITIVE LOGITS
     born
    0.20
     Born
    0.17
    Born
    0.16
    born
    0.15
    -born
    0.14
    çĶŁ
    0.13
     birth
    0.12
    çĶŁçļĦ
    0.11
     native
    0.10
     çĶŁ
    0.10
    Act Density 0.046%

    No Known Activations