INDEX
    Explanations

    proper nouns, specifically names of individuals or locations

    New Auto-Interp
    Negative Logits
    âĶĢâĶĢ
    -0.74
     ANGEL
    -0.70
    ktop
    -0.70
     BOX
    -0.69
     VIDE
    -0.66
     daytime
    -0.66
     srfAttach
    -0.62
     volume
    -0.61
     Serie
    -0.61
     PROG
    -0.60
    POSITIVE LOGITS
    baugh
    1.31
    enberg
    1.25
    hoff
    1.17
    hart
    1.14
    ley
    1.12
    love
    1.10
    ingham
    1.09
    gren
    1.08
    berger
    1.07
    meyer
    1.06
    Act Density 0.231%

    No Known Activations