INDEX
    Explanations

    instances of the word "Here" in various contexts

    New Auto-Interp
    Negative Logits
     Archdemon
    -0.71
    ONSORED
    -0.70
    natureconservancy
    -0.65
    acci
    -0.61
     Strait
    -0.59
    osc
    -0.58
    Closure
    -0.55
     elim
    -0.54
    efe
    -0.54
     manpower
    -0.53
    POSITIVE LOGITS
    tical
    1.23
    tics
    1.19
    abouts
    1.17
    tic
    1.04
     Comes
    1.02
    itia
    0.84
    after
    0.83
    endix
    0.80
    ford
    0.78
    ibaba
    0.78
    Act Density 0.028%

    No Known Activations