INDEX
    Explanations

    references to publications or documents

    references to published works

    New Auto-Interp
    Negative Logits
    itialized
    -0.83
    llan
    -0.76
    olulu
    -0.73
    uppet
    -0.72
    eric
    -0.69
    heed
    -0.69
    pell
    -0.68
    bones
    -0.68
    othes
    -0.68
    phas
    -0.67
    POSITIVE LOGITS
    lisher
    1.15
    lishing
    0.88
    lishes
    0.84
    DragonMagazine
    0.80
     Publishers
    0.74
    代
    0.73
     Journals
    0.72
     publication
    0.70
     date
    0.70
    é»Ĵ
    0.70
    Act Density 0.012%

    No Known Activations