INDEX
    Explanations

    references to actions taken or events that have occurred

    the occurrence of the word "have."

    New Auto-Interp
    Negative Logits
     territ
    -0.63
    catentry
    -0.63
    ocol
    -0.60
     colonization
    -0.60
     Apart
    -0.59
    housing
    -0.58
     fireball
    -0.55
     settlement
    -0.54
     blending
    -0.54
     neigh
    -0.53
    POSITIVE LOGITS
     been
    1.22
    been
    1.04
     Been
    0.96
     undergone
    0.93
    gotten
    0.92
     gotten
    0.92
     taken
    0.85
    ĸļ
    0.85
     gone
    0.84
     done
    0.82
    Act Density 0.242%

    No Known Activations