INDEX
    Explanations

    verbs related to exploration, investigation, or revelation

    instances of the word "discovered."

    New Auto-Interp
    Negative Logits
    stay
    -0.63
    orting
    -0.60
     tone
    -0.59
    voice
    -0.59
    depending
    -0.59
    regulation
    -0.58
    drive
    -0.58
    clinton
    -0.57
    VOL
    -0.57
     fray
    -0.57
    POSITIVE LOGITS
     discovered
    3.29
     unearthed
    2.13
     uncovered
    2.07
     discovers
    1.95
     discover
    1.93
     found
    1.89
     detected
    1.85
     noticed
    1.79
     discovering
    1.73
     discoveries
    1.67
    Act Density 0.013%

    No Known Activations