INDEX
    Explanations

    formatting elements and syntactical structures in documents

    New Auto-Interp
    Negative Logits
    .
    -0.50
    Bbb
    -0.49
    -0.46
     in
    -0.45
     must
    -0.45
    huriyet
    -0.44
     plastic
    -0.44
     expressed
    -0.43
    -0.43
     $
    -0.43
    POSITIVE LOGITS
    tagHelperRunner
    0.81
     Monfieur
    0.79
     myſelf
    0.79
     auffi
    0.75
     leaſt
    0.73
     beginnetje
    0.72
    +:+
    0.72
     purpoſe
    0.71
     ſch
    0.71
    fjspx
    0.71
    Act Density 0.954%

    No Known Activations