INDEX
    Explanations

    specific names or entities

    the phrase "the likes of" used in various contexts, often referring to notable people or entities

    New Auto-Interp
    Negative Logits
    arding
    -0.79
    INA
    -0.75
     Ethics
    -0.73
    BILITIES
    -0.66
    INTON
    -0.64
    士
    -0.63
    ento
    -0.62
    verning
    -0.61
     Springs
    -0.61
    angan
    -0.61
    POSITIVE LOGITS
    liest
    1.24
    lihood
    1.22
    lier
    1.10
    liness
    0.87
    ettings
    0.80
    bill
    0.73
    creen
    0.73
    mith
    0.71
    hots
    0.71
    hai
    0.68
    Act Density 0.016%

    No Known Activations