INDEX
    Explanations

    references to religious texts and philosophical discussions

    New Auto-Interp
    Negative Logits
     Hutchinson
    -0.60
     proportion
    -0.59
     Bonner
    -0.52
     Schulte
    -0.52
    ANCER
    -0.51
     McCullough
    -0.50
     Cotter
    -0.50
     Farrell
    -0.49
     Alvarado
    -0.49
     McGee
    -0.49
    POSITIVE LOGITS
     ainfi
    0.56
     ſtate
    0.55
    ientras
    0.55
     purpoſe
    0.53
     ſche
    0.52
     éduc
    0.52
    ambién
    0.51
    unanje
    0.49
     Relaciones
    0.49
    WriteBarrier
    0.47
    Act Density 0.010%

    No Known Activations