INDEX
    Explanations

    occurrences of the word 'but'

    New Auto-Interp
    Negative Logits
    selage
    -0.97
    natureconservancy
    -0.83
    otton
    -0.75
    lf
    -0.74
    pload
    -0.72
    ãģķ
    -0.72
    gee
    -0.72
    pron
    -0.70
     prope
    -0.70
     Galile
    -0.70
    POSITIVE LOGITS
    antes
    1.01
    ante
    0.95
    aneously
    0.82
    INGTON
    0.79
    interstitial
    0.77
     Shots
    0.76
    onement
    0.74
    tered
    0.73
    iary
    0.73
    inct
    0.73
    Act Density 0.114%

    No Known Activations