INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    arily
    -0.85
    ebook
    -0.79
    lang
    -0.75
    lr
    -0.74
    sonian
    -0.73
    nir
    -0.70
    ahead
    -0.68
    tan
    -0.67
    edly
    -0.66
    rums
    -0.65
    POSITIVE LOGITS
     Hotel
    1.10
     Plaza
    0.83
    ZI
    0.81
     Room
    0.81
     Polo
    0.81
     Rooms
    0.77
     Club
    0.76
    wives
    0.76
     Towers
    0.75
    Hot
    0.75
    Act Density 0.029%

    No Known Activations