INDEX
    Explanations

    locations such as cities and countries

    instances of the word "where," indicating a focus on locations or settings

    New Auto-Interp
    Negative Logits
    TAG
    -0.75
    âĨ
    -0.73
    Woman
    -0.69
    leeve
    -0.66
    hack
    -0.65
    termin
    -0.64
    apult
    -0.64
    fml
    -0.64
    ME
    -0.63
    Gaza
    -0.63
    POSITIVE LOGITS
    upon
    1.56
    soever
    0.97
     they
    0.92
    abouts
    0.85
     temperatures
    0.82
     he
    0.81
     residents
    0.77
     it
    0.75
     tensions
    0.74
     winters
    0.74
    Act Density 0.049%

    No Known Activations