INDEX
    Explanations

    mentions of the word "Hollywood"

    New Auto-Interp
    Negative Logits
    ktop
    -0.95
    arity
    -0.85
    İĭ
    -0.76
    theless
    -0.76
    cific
    -0.71
    ĸļ
    -0.70
    nants
    -0.69
    onian
    -0.69
    avez
    -0.68
    expr
    -0.68
    POSITIVE LOGITS
     Hills
    1.00
     Reporter
    0.99
     studios
    0.99
     Studios
    0.93
     mog
    0.89
     Boulevard
    0.89
     Hollywood
    0.86
    Film
    0.85
     Pictures
    0.85
     blockbuster
    0.82
    Act Density 0.022%

    No Known Activations