INDEX
    Explanations

    references to specific musical works and authorship in compositions

    after commas, quotes, and apostrophes

    titles of books and artworks

    New Auto-Interp
    Negative Logits
    WriteBarrier
    -0.57
     Longfellow
    -0.51
     Ibsen
    -0.49
    XDECREF
    -0.49
    estros
    -0.48
     speciali
    -0.46
    Tradition
    -0.45
     onOptions
    -0.45
     Chaucer
    -0.44
    دانشنامهٔ
    -0.44
    POSITIVE LOGITS
    はじめに
    0.66
    ratulations
    0.65
    atise
    0.63
    ècie
    0.60
    ائص
    0.57
     الأربع
    0.56
    ModelAdmin
    0.56
    annten
    0.56
    pyplot
    0.55
    __':
    0.55
    Act Density 0.135%

    No Known Activations