INDEX
    Explanations

    proper nouns, especially names likely associated with various individuals

    proper nouns and terms related to particular names or concepts

    New Auto-Interp
    Negative Logits
    cook
    -0.76
    olerance
    -0.75
    OPLE
    -0.74
    phrine
    -0.73
    step
    -0.72
    KEN
    -0.69
     Handler
    -0.69
    eco
    -0.68
     Raider
    -0.68
    watch
    -0.67
    POSITIVE LOGITS
    icles
    1.22
    icle
    1.10
    icular
    0.98
    rha
    0.89
    itational
    0.87
    oras
    0.87
    ora
    0.87
    ilion
    0.85
    rix
    0.85
    ricular
    0.81
    Act Density 0.017%

    No Known Activations