INDEX
    Explanations

    references to academic affiliations and institutions

    New Auto-Interp
    Negative Logits
     Bers
    -0.55
     아이
    -0.53
     kym
    -0.50
    cocc
    -0.49
    2
    -0.48
    -0.47
    livejournal
    -0.47
     übersch
    -0.47
     Recre
    -0.47
     *)__
    -0.47
    POSITIVE LOGITS
    ,",
    0.93
    (',',
    0.84
     :,
    0.83
    (",",
    0.80
    ,',
    0.79
    (@"%@",
    0.78
     Hamlin
    0.78
    omiast
    0.77
    ,:),
    0.77
    ,...,
    0.77
    Act Density 0.417%

    No Known Activations