INDEX
    Explanations

    words related to specific locations or geographical features

    New Auto-Interp
    Negative Logits
    abbit
    -1.74
     ?"
    -1.66
     etc
    -1.62
     /**<
    -1.60
    anni
    -1.60
    aned
    -1.51
    anus
    -1.48
    -1.47
    aten
    -1.44
     Figure
    -1.43
    POSITIVE LOGITS
     submissions
    1.79
    cco
    1.79
     victories
    1.58
    xim
    1.58
    uchy
    1.54
     win
    1.53
    bie
    1.52
    book
    1.46
    otherapy
    1.43
     punches
    1.42
    Act Density 0.122%

    No Known Activations