INDEX
    Explanations

    words related to the entertainment industry, specifically Hollywood

    New Auto-Interp
    Negative Logits
    ktop
    -0.95
    İĭ
    -0.86
    arity
    -0.83
    avez
    -0.81
    theless
    -0.79
    cific
    -0.79
    etheless
    -0.78
    nants
    -0.73
    unin
    -0.72
    ombo
    -0.72
    POSITIVE LOGITS
     Reporter
    1.05
     studios
    1.00
     Studios
    1.00
     Hills
    1.00
     Hollywood
    0.93
     Boulevard
    0.89
     Pictures
    0.89
     movies
    0.86
     mog
    0.85
     blockbuster
    0.84
    Act Density 0.029%

    No Known Activations