INDEX
    Explanations

    names of people

    New Auto-Interp
    Negative Logits
    Nar
    -0.57
     Breaker
    -0.57
    ãĤ¤
    -0.55
    ÃįÃį
    -0.55
    س
    -0.53
    PDATE
    -0.52
     Newtown
    -0.52
    izoph
    -0.52
     MDMA
    -0.51
     HUGE
    -0.51
    POSITIVE LOGITS
    's
    1.27
     himself
    1.22
     testified
    0.96
     herself
    0.95
     Productions
    0.93
     wrote
    0.91
     remembers
    0.89
     admits
    0.86
     presided
    0.85
     recalls
    0.84
    Act Density 0.229%

    No Known Activations