INDEX
    Explanations

    academic journal citations

    New Auto-Interp
    Negative Logits
     FRIDAY
    -0.89
    doise
    -0.87
     Informatics
    -0.86
    endedor
    -0.86
     WEDNESDAY
    -0.86
    局部
    -0.83
     TUESDAY
    -0.83
    achim
    -0.83
    ;$
    -0.82
     reported
    -0.81
    POSITIVE LOGITS
     winter
    1.02
     помощи
    1.01
     spring
    0.97
    了不少
    0.91
     Spring
    0.90
    Spring
    0.87
    Jama
    0.85
    Yale
    0.85
    STOR
    0.85
    יצד
    0.83
    Act Density 0.005%

    No Known Activations