INDEX
    Explanations

    locations or places mentioned in the text

    the word "in" across various contexts

    New Auto-Interp
    Negative Logits
    Upload
    -0.64
    giving
    -0.60
     critics
    -0.59
    nell
    -0.59
    76561
    -0.57
    enaries
    -0.55
    linger
    -0.54
     chops
    -0.54
     detractors
    -0.54
    ¿½
    -0.54
    POSITIVE LOGITS
     sight
    0.98
     existence
    0.94
    between
    0.93
     society
    0.87
    animate
    0.83
    planet
    0.81
     life
    0.81
     heaven
    0.79
     between
    0.78
     history
    0.77
    Act Density 0.116%

    No Known Activations