INDEX
    Explanations

    names of people or locations

    proper nouns, specifically names of individuals and locations

    New Auto-Interp
    Negative Logits
    Downloadha
    -0.75
     Frozen
    -0.68
    ndra
    -0.67
    warm
    -0.64
     favour
    -0.63
    tics
    -0.62
    adesh
    -0.62
     Guilty
    -0.61
     polar
    -0.59
    daily
    -0.58
    POSITIVE LOGITS
    ONSORED
    0.83
    Department
    0.67
    ENSE
    0.66
    itzer
    0.65
    utenberg
    0.65
    ourke
    0.63
    ãĤ¼ãĤ¦ãĤ¹
    0.63
    agall
    0.59
     Dept
    0.59
    nel
    0.58
    Act Density 0.309%

    No Known Activations