INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     GenerationType
    -0.76
    Getting
    -0.69
     Getting
    -0.68
    Adding
    -0.64
     Obtaining
    -0.63
    having
    -0.63
    getting
    -0.62
     EconPapers
    -0.62
     GETTING
    -0.62
     saites
    -0.60
    POSITIVE LOGITS
     not
    1.54
    not
    1.05
     NOT
    0.89
    Not
    0.77
     Not
    0.74
     bukan
    0.68
    NOT
    0.67
     mitte
    0.61
     ikke
    0.60
    noty
    0.60
    Act Density 0.001%

    No Known Activations