INDEX
    Explanations

    companies or organizations

    comparisons involving the word "like."

    New Auto-Interp
    Negative Logits
    Published
    -0.76
    hiba
    -0.72
    inas
    -0.71
    zees
    -0.70
    erial
    -0.69
    ells
    -0.67
    atched
    -0.67
     showc
    -0.67
    ulty
    -0.66
    ipple
    -0.66
    POSITIVE LOGITS
    lihood
    1.55
    lier
    1.10
     minded
    0.97
    liest
    0.95
    minded
    0.92
     ours
    0.90
     yours
    0.79
    liness
    0.78
     wildfire
    0.77
     theirs
    0.73
    Act Density 0.069%

    No Known Activations