INDEX
    Explanations

    mentions of New York City

    New Auto-Interp
    Negative Logits
    rar
    -0.92
    ãĥĦ
    -0.84
    ibilities
    -0.81
    terness
    -0.80
    yip
    -0.77
    ãĥ¼ãĥĨ
    -0.76
    iru
    -0.75
    gotten
    -0.74
    VALUE
    -0.73
    wcsstore
    -0.72
    POSITIVE LOGITS
     skyline
    0.97
     subway
    0.95
     streets
    0.95
    scape
    0.95
     borough
    0.94
     landmarks
    0.94
     Council
    0.91
     FC
    0.90
     neighborhoods
    0.88
     Mayor
    0.86
    Act Density 0.022%

    No Known Activations