INDEX
    Explanations

    references to specific years and educational terms

    New Auto-Interp
    Negative Logits
     apost
    -0.15
     Shift
    -0.15
     SWITCH
    -0.14
     audition
    -0.14
    entin
    -0.14
     DÄĽ
    -0.14
    кÑĤа
    -0.13
    bane
    -0.13
     instantiated
    -0.13
    bia
    -0.13
    POSITIVE LOGITS
     season
    0.17
    aille
    0.15
    isÃŃ
    0.15
    RowIndex
    0.14
    acio
    0.14
     Ñģез
    0.14
     nackte
    0.14
    LOAT
    0.14
    éo
    0.14
    ó
    0.14
    Act Density 0.041%

    No Known Activations