INDEX
    Explanations

    texts discussing the directionality of time, focusing on concepts related to moving backwards or forwards in time

    New Auto-Interp
    Negative Logits
    ateurs
    -0.82
    raltar
    -0.81
    rament
    -0.81
    oleon
    -0.77
    abase
    -0.74
    anooga
    -0.74
    lez
    -0.74
    riz
    -0.73
    rol
    -0.72
    atum
    -0.72
    POSITIVE LOGITS
    wards
    0.92
    stairs
    0.87
    ward
    0.87
     compatibility
    0.84
     compat
    0.78
     spiral
    0.78
    WARD
    0.77
    step
    0.74
    side
    0.72
     reflection
    0.66
    Act Density 5.090%

    No Known Activations