INDEX
    Explanations

    dates in the format "Month, Year" with high activation values

    New Auto-Interp
    Negative Logits
     heapq
    -0.67
    Nichts
    -0.63
     rospy
    -0.61
    astéro
    -0.60
     [''],
    -0.59
    Gambas
    -0.59
    אין
    -0.59
    المنا
    -0.59
    asteroide
    -0.56
     pymongo
    -0.56
    POSITIVE LOGITS
     sement
    0.83
    1
    0.82
     monaster
    0.77
     frambo
    0.76
     kön
    0.76
     marte
    0.76
     meras
    0.75
     vitale
    0.71
     vermel
    0.71
     utop
    0.71
    Act Density 0.058%

    No Known Activations