INDEX
    Explanations

    advertisements in the text

    instances of advertisements

    New Auto-Interp
    Negative Logits
    assic
    -0.80
     resultant
    -0.69
     masse
    -0.68
     helicop
    -0.68
     gorilla
    -0.67
     apex
    -0.67
     tant
    -0.66
     destruct
    -0.66
    stood
    -0.65
     embark
    -0.65
    POSITIVE LOGITS
    arty
    0.86
    edIn
    0.81
    edin
    0.78
    rences
    0.78
    yip
    0.77
     ILCS
    0.77
    ences
    0.76
    eatures
    0.75
    Comments
    0.75
    ROR
    0.72
    Act Density 0.016%

    No Known Activations