INDEX
    Explanations

    references to public discourse or public domain information

    New Auto-Interp
    Negative Logits
    adil
    -0.17
    vise
    -0.15
    akeup
    -0.15
    ikip
    -0.15
     sarcast
    -0.14
    ilent
    -0.14
    folk
    -0.14
    iteDatabase
    -0.14
    akte
    -0.14
     Vác
    -0.14
    POSITIVE LOGITS
     commission
    0.18
     Experiment
    0.15
     Penguin
    0.15
     exciting
    0.15
     AUTHORS
    0.15
     Commission
    0.15
     book
    0.15
    Experiment
    0.14
    iez
    0.14
    edit
    0.14
    Act Density 0.000%

    No Known Activations