INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    BOOK
    -0.80
    odic
    -0.80
    DonaldTrump
    -0.75
    ocene
    -0.71
    rencies
    -0.70
     wagon
    -0.68
    alez
    -0.68
    ãĥĥãĥĪ
    -0.68
    */(
    -0.66
    Sov
    -0.66
    POSITIVE LOGITS
     Bruins
    1.06
     Graduate
    0.89
     Extension
    0.86
     UCLA
    0.86
     University
    0.82
    UC
    0.81
     College
    0.81
     alumni
    0.79
     undergrad
    0.79
     Libraries
    0.79
    Act Density 0.005%

    No Known Activations