INDEX
    Explanations

    academic references and citations contained within the text

    (Eds.) followed by book titles

    New Auto-Interp
    Negative Logits
    featureID
    -0.65
     
    -0.59
    клопе
    -0.59
    unque
    -0.58
     TAMBÉM
    -0.58
     ligiloj
    -0.57
    UniformLocation
    -0.55
    UMBIA
    -0.55
     löyty
    -0.55
    _));
    -0.54
    POSITIVE LOGITS
     ed
    0.89
     eds
    0.79
    Ed
    0.78
     editor
    0.77
    ed
    0.73
    eds
    0.73
     Ed
    0.72
    Eds
    0.71
     Eds
    0.70
     editors
    0.66
    Act Density 0.071%

    No Known Activations