INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     vip
    -0.08
     tread
    -0.08
     Insta
    -0.08
     Stre
    -0.07
     Pound
    -0.07
     vape
    -0.07
     Hamlet
    -0.07
     Charter
    -0.07
     Indi
    -0.07
     shove
    -0.07
    POSITIVE LOGITS
     Fernández
    0.08
     Alexandra
    0.08
    Sf
    0.08
     Fernandez
    0.08
     bos
    0.08
    ás
    0.08
    URATION
    0.07
    urations
    0.07
     Sf
    0.07
     rangement
    0.07
    Act Density 0.006%

    No Known Activations