INDEX
    Explanations

    references to specific dates, particularly those related to historical events

    references to specific dates, particularly in September

    New Auto-Interp
    Negative Logits
     bottleneck
    -0.66
    Reviewer
    -0.63
     segreg
    -0.60
     isolated
    -0.60
    éĹĺ
    -0.59
    theless
    -0.58
     Tart
    -0.57
     material
    -0.56
     Hearts
    -0.56
     idi
    -0.56
    POSITIVE LOGITS
    .,
    1.16
    eme
    1.07
    .;
    0.97
    imus
    0.96
    sburg
    0.95
    .):
    0.94
    .-
    0.94
    ibel
    0.93
    mented
    0.92
    .,"
    0.91
    Act Density 0.018%

    No Known Activations