INDEX
    Explanations

    mathematical notations and variables in a formal context

    LaTeX math mode expressions

    New Auto-Interp
    Negative Logits
    )])
    -0.74
    ]),
    -0.71
    $",
    -0.71
    {}".
    -0.69
    ]').
    -0.65
    ]])
    -0.65
    /$',
    -0.64
    %@",
    -0.63
    }<\
    -0.62
    ]))
    -0.62
    POSITIVE LOGITS
     pleaſure
    0.87
     ſmall
    0.87
     myſelf
    0.83
     Reſ
    0.81
     ſeveral
    0.80
    ſelves
    0.80
     PLW
    0.80
     occaf
    0.79
    ſelf
    0.78
     cauſe
    0.78
    Act Density 0.838%

    No Known Activations