INDEX
    Explanations

    phrases indicating lists or examples

    New Auto-Interp
    Negative Logits
    KommentareTeilen
    -0.64
    OGND
    -0.57
     lainnya
    -0.57
     оригіналу
    -0.57
    地看着
    -0.55
    rdı
    -0.55
    rboles
    -0.55
    contentLoaded
    -0.53
    tymologie
    -0.53
    Geplaatst
    -0.53
    POSITIVE LOGITS
     following
    3.76
    following
    3.29
     Following
    2.78
     FOLLOWING
    2.71
    Following
    2.69
     siguientes
    2.46
     seguenti
    2.39
     seguinte
    2.37
     siguiente
    2.30
     seguintes
    2.23
    Act Density 0.934%

    No Known Activations