INDEX
    Explanations

    words related to fine-tuning or adjustments

    parts of words or suffixes associated with word formation

    New Auto-Interp
    Negative Logits
     hardened
    -0.64
    SPONSORED
    -0.63
    izoph
    -0.62
    otropic
    -0.60
    herry
    -0.60
     representations
    -0.60
     sheer
    -0.60
     embodiments
    -0.59
     thick
    -0.59
     scripture
    -0.59
    POSITIVE LOGITS
    camp
    0.82
     Kemp
    0.75
    atis
    0.71
    ports
    0.69
     Davis
    0.66
     Daniels
    0.63
     Tags
    0.61
    Camp
    0.60
    sburgh
    0.60
    Davis
    0.60
    Act Density 0.320%

    No Known Activations