INDEX
    Explanations

    words related to reasoning or justification

    words or phrases indicating a sense of reasonableness or sufficiency

    New Auto-Interp
    Negative Logits
     ways
    -0.71
    raltar
    -0.69
    gery
    -0.69
     fixture
    -0.68
    ger
    -0.67
    ials
    -0.67
     methods
    -0.66
    uese
    -0.62
     substitution
    -0.62
    glers
    -0.62
    POSITIVE LOGITS
     situated
    0.91
     priced
    0.82
     spaced
    0.82
     positioned
    0.81
    beit
    0.79
    nerg
    0.78
     entertained
    0.77
    greg
    0.76
     equipped
    0.76
     suited
    0.75
    Act Density 0.033%

    No Known Activations