INDEX
    Explanations

    Japanese names and some other specific personal and geographical names

    proper nouns, specifically names and organizations

    New Auto-Interp
    Negative Logits
    ishes
    -0.60
     undo
    -0.59
    tein
    -0.59
     Frog
    -0.57
     Turing
    -0.56
     spawning
    -0.56
     DPS
    -0.56
    odder
    -0.55
     MIA
    -0.55
     fingerprints
    -0.55
    POSITIVE LOGITS
    acan
    0.84
    æ©
    0.79
    etus
    0.76
    VT
    0.76
    é¾įå
    0.75
    umerable
    0.75
    Lenin
    0.71
    hoe
    0.70
    ACA
    0.70
    idis
    0.69
    Act Density 0.118%

    No Known Activations