INDEX
    Explanations

    phrases related to comparison or belonging

    phrases related to majority or significant numbers

    New Auto-Interp
    Negative Logits
     referen
    -0.61
     Seym
    -0.53
     Moroc
    -0.52
     Vaugh
    -0.52
     thous
    -0.50
    chuk
    -0.49
    edIn
    -0.48
    ãĥĩãĤ£
    -0.47
     helicop
    -0.45
     corrid
    -0.45
    POSITIVE LOGITS
     largeDownload
    0.58
    othes
    0.47
    mosp
    0.46
     DRAGON
    0.45
    immer
    0.45
    taboola
    0.45
    ciples
    0.44
    cing
    0.44
    rg
    0.43
    hard
    0.43
    Act Density 3.359%

    No Known Activations