INDEX
    Explanations

    references to specific geographical locations or directions

    New Auto-Interp
    Negative Logits
    <bos>
    -0.90
     Sanderson
    -0.57
     mettent
    -0.53
     Pyrr
    -0.52
     Duncan
    -0.52
     Tierney
    -0.51
     Kerr
    -0.50
    hdr
    -0.50
    Ales
    -0.50
    Kir
    -0.49
    POSITIVE LOGITS
     ritard
    0.94
     paff
    0.94
     vna
    0.92
     makro
    0.91
     marseille
    0.90
     ohr
    0.90
     juft
    0.89
     broder
    0.89
     mef
    0.89
     ftre
    0.88
    Act Density 0.806%

    No Known Activations