INDEX
    Explanations

    names of individuals

    proper nouns, specifically names of people

    New Auto-Interp
    Negative Logits
    berra
    -0.56
     âĢº
    -0.55
    uminati
    -0.54
     Archdemon
    -0.53
    Interested
    -0.52
     Paran
    -0.50
     attm
    -0.49
     Flavoring
    -0.49
     Slayer
    -0.49
    taboola
    -0.48
    POSITIVE LOGITS
    kson
    0.63
    espie
    0.62
     recalled
    0.59
     laughed
    0.55
     vetoed
    0.54
    enson
    0.53
    yden
    0.51
     wrote
    0.51
     conceded
    0.51
     detractors
    0.51
    Act Density 0.417%

    No Known Activations