INDEX
    Explanations

    references to conflicts and wars

    New Auto-Interp
    Negative Logits
    ê°Ŀ
    -0.15
    issan
    -0.14
    ourmet
    -0.13
    ÌĢ
    -0.13
    oppins
    -0.13
    ละ
    -0.13
    .sap
    -0.13
    crc
    -0.13
    ABCDEFGHI
    -0.13
    .mvp
    -0.13
    POSITIVE LOGITS
     follow
    1.20
    follow
    1.17
     Follow
    1.13
     follows
    1.09
    Follow
    1.08
     followed
    1.06
     FOLLOW
    0.97
    -follow
    0.97
    .follow
    0.93
    _follow
    0.92
    Act Density 0.354%

    No Known Activations