INDEX
    Explanations

    phrases related to novel concepts or new initiatives

    instances of the word "idea."

    New Auto-Interp
    Negative Logits
     Ago
    -0.74
    ndum
    -0.72
     Peaks
    -0.66
    lake
    -0.66
     ILCS
    -0.65
    eworthy
    -0.63
    gar
    -0.60
    east
    -0.60
     Reporting
    -0.59
    lee
    -0.59
    POSITIVE LOGITS
    ually
    1.08
     idea
    0.82
    atical
    0.81
     moot
    0.81
    yout
    0.78
    @#&
    0.73
    atics
    0.73
    uitive
    0.73
    SourceFile
    0.71
    ual
    0.70
    Act Density 0.030%

    No Known Activations