INDEX
    Explanations

    mentions of specific names or surnames

    proper nouns, specifically names of people

    New Auto-Interp
    Negative Logits
    soDeliveryDate
    -0.60
    Els
    -0.59
    ãĥ¼ãĥ³
    -0.55
     corrid
    -0.55
     comprom
    -0.54
     subscript
    -0.54
     horizont
    -0.54
     mathemat
    -0.52
     cyt
    -0.51
     governors
    -0.50
    POSITIVE LOGITS
     Jr
    1.04
     Sr
    0.87
     III
    0.75
     aka
    0.69
    wine
    0.68
    velt
    0.67
    gart
    0.66
    stadt
    0.65
    sson
    0.65
    kamp
    0.63
    Act Density 0.467%

    No Known Activations