INDEX
    Explanations

    references to authors and their perspectives on historical arguments

    New Auto-Interp
    Negative Logits
    arkin
    -0.14
    elan
    -0.14
    HQ
    -0.14
    tered
    -0.14
    abinet
    -0.13
    lags
    -0.13
    _NC
    -0.13
    @(
    -0.13
    BY
    -0.13
    ered
    -0.13
    POSITIVE LOGITS
    olars
    0.15
    taÅŁ
    0.15
    á»ķ
    0.15
    Sizer
    0.13
    388
    0.13
    коÑĤ
    0.13
    887
    0.13
    olar
    0.13
    agr
    0.13
    mah
    0.13
    Act Density 0.144%

    No Known Activations