INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    assin
    -0.83
     Siegel
    -0.66
    icana
    -0.57
     économies
    -0.56
     réguli
    -0.54
    initiated
    -0.50
     egyszer
    -0.49
     Burnett
    -0.47
     culturelles
    -0.47
    chie
    -0.46
    POSITIVE LOGITS
    Tembelea
    0.74
    ArgsConstructor
    0.69
    __":
    
    0.67
     LUMP
    0.67
    __":
    0.66
     EconPapers
    0.65
    AsUp
    0.64
    EndInit
    0.63
     становника
    0.59
     يتيمه
    0.59
    Act Density 0.086%

    No Known Activations