INDEX
    Explanations

    mentions of the word "Form" in various contexts

    New Auto-Interp
    Negative Logits
     railing
    -0.70
    >>\
    -0.68
    EStreamFrame
    -0.63
    visor
    -0.62
    sung
    -0.62
     spree
    -0.59
     Throne
    -0.59
    foreseen
    -0.57
     Leone
    -0.56
     Sons
    -0.56
    POSITIVE LOGITS
    aldehyde
    1.59
    idable
    1.34
    atter
    1.27
    atted
    1.26
    ulating
    1.23
    ulas
    1.16
    ative
    1.15
    ulates
    1.13
    ulations
    1.12
    ula
    1.11
    Act Density 0.026%

    No Known Activations