INDEX
    Explanations

    phrases indicating anticipation or foresight regarding events

    New Auto-Interp
    Negative Logits
    zw
    -0.15
    ane
    -0.14
    iances
    -0.14
    pla
    -0.14
    place
    -0.14
    agan
    -0.14
    CADE
    -0.14
    aus
    -0.14
    ce
    -0.13
    uti
    -0.13
    POSITIVE LOGITS
    usra
    0.17
     actual
    0.15
    NSS
    0.15
     publication
    0.15
    apia
    0.14
    .Skin
    0.14
    ñana
    0.14
    .Restr
    0.14
     cop
    0.14
    ucker
    0.14
    Act Density 0.036%

    No Known Activations