INDEX
    Explanations

    expressions of the word "to" indicating purpose or intention

    New Auto-Interp
    Negative Logits
     ſtate
    -1.07
     purpoſe
    -1.00
     auroit
    -1.00
     houſe
    -0.99
     feroit
    -0.94
     myſelf
    -0.94
     pleaſure
    -0.93
     Inscrivez
    -0.93
     enfans
    -0.92
     avoient
    -0.91
    POSITIVE LOGITS
    %");
    0.81
    "):
    
    0.80
     “
    0.79
    "])
    
    0.79
    %")
    0.78
     a
    0.76
    0.74
     an
    0.73
    "]
    
    0.69
    %";
    0.68
    Act Density 0.296%

    No Known Activations