INDEX
    Explanations

    numbers indicating measurements or figures

    phrases indicative of transformation or change

    New Auto-Interp
    Negative Logits
    INTON
    -0.64
    wikipedia
    -0.60
    Synopsis
    -0.60
    models
    -0.59
    chapter
    -0.57
    FactoryReloaded
    -0.56
    ebook
    -0.56
    python
    -0.56
    oplan
    -0.55
    lished
    -0.55
    POSITIVE LOGITS
     scrut
    0.51
     Crusher
    0.49
     Jagu
    0.48
     Row
    0.47
     reluct
    0.46
     ster
    0.46
     Vaugh
    0.45
     Kear
    0.45
     resil
    0.44
     balk
    0.44
    Act Density 1.709%

    No Known Activations