INDEX
    Explanations

    references to guidance or advice in the context of shopping or home improvement

    New Auto-Interp
    Negative Logits
    okus
    -0.16
    aces
    -0.15
    izik
    -0.15
    Ñģов
    -0.15
    aket
    -0.14
    anzeigen
    -0.14
    antz
    -0.14
     Duc
    -0.14
    oppers
    -0.14
    мена
    -0.13
    POSITIVE LOGITS
    esson
    0.18
    unde
    0.17
    ool
    0.16
    elly
    0.15
    afi
    0.14
    kud
    0.14
    PRS
    0.14
    asje
    0.14
    bubble
    0.14
     bund
    0.14
    Act Density 0.002%

    No Known Activations