INDEX
    Explanations

    phrases indicating agreement to terms and conditions

    phrases related to user agreement and consent in terms of privacy and policies

    New Auto-Interp
    Negative Logits
    ilts
    -0.61
     glim
    -0.60
    oster
    -0.60
    amina
    -0.58
    ngth
    -0.57
    iculture
    -0.57
    mania
    -0.55
    ONES
    -0.54
    gins
    -0.54
    ordinary
    -0.54
    POSITIVE LOGITS
    itivity
    0.68
     {{
    0.65
     thereto
    0.64
    iliate
    0.63
    taboola
    0.61
    emn
    0.59
     Hilbert
    0.59
     to
    0.58
    20439
    0.56
    ettings
    0.56
    Act Density 0.025%

    No Known Activations