INDEX
    Explanations

    specific symbols or mathematical notation used in equations

    New Auto-Interp
    Negative Logits
     flo
    -0.52
     fla
    -0.49
    fio
    -0.47
     Word
    -0.45
    </strong>
    -0.44
     dise
    -0.44
     zin
    -0.44
    nextLine
    -0.44
    Tham
    -0.44
     AllAfrica
    -0.43
    POSITIVE LOGITS
     Monfieur
    1.20
     myſelf
    1.10
     quæ
    1.07
     himſelf
    1.06
     purpoſe
    1.05
     chofe
    0.98
     themſelves
    0.95
     feroit
    0.94
     itſelf
    0.94
     ainfi
    0.92
    Act Density 0.005%

    No Known Activations