INDEX
    Explanations

    phrases related to similarity or comparison

    comparisons of similarity between different subjects or concepts

    New Auto-Interp
    Negative Logits
    nos
    -0.64
    unes
    -0.54
     trusted
    -0.53
    onso
    -0.52
    rection
    -0.52
    Published
    -0.51
     Sev
    -0.50
    achment
    -0.50
    capital
    -0.50
     Physicians
    -0.50
    POSITIVE LOGITS
     to
    0.98
     twins
    0.82
     thereto
    0.78
     enough
    0.75
    lihood
    0.74
    ities
    0.74
     unto
    0.74
    ively
    0.70
     except
    0.68
    icut
    0.67
    Act Density 0.048%

    No Known Activations