INDEX
    Explanations

    references to actors

    references to actors and actresses

    New Auto-Interp
    Negative Logits
    aults
    -0.75
    yrs
    -0.70
    tops
    -0.68
    wn
    -0.64
    hops
    -0.63
    ills
    -0.63
    RESULTS
    -0.63
    compl
    -0.62
    ür
    -0.62
    cling
    -0.62
    POSITIVE LOGITS
     actor
    3.84
     actress
    2.83
     Actor
    2.64
     actors
    2.58
    Actor
    2.49
     Actress
    1.94
     filmmaker
    1.81
     comedian
    1.78
    actor
    1.73
     singer
    1.69
    Act Density 0.020%

    No Known Activations