INDEX
    Explanations

    phrases related to advertisements

    mentions of advertisements

    New Auto-Interp
    Negative Logits
    ĨĴ
    -0.80
    uckle
    -0.80
     Citation
    -0.77
    GoldMagikarp
    -0.69
    ĸļ
    -0.69
    ource
    -0.66
    20439
    -0.66
     Caribbean
    -0.65
    ESSION
    -0.65
     Remem
    -0.65
    POSITIVE LOGITS
    verts
    1.11
     hoc
    1.11
    vertising
    1.11
    elaide
    1.10
    roit
    1.06
     nause
    1.03
    verb
    0.99
    vertis
    0.99
    ieu
    0.98
    idas
    0.97
    Act Density 0.019%

    No Known Activations