INDEX
    Explanations

    URL patterns or references in the text

    New Auto-Interp
    Negative Logits
     Efq
    -0.98
     pleaſure
    -0.98
     myſelf
    -0.92
     doubtnut
    -0.90
     itſelf
    -0.89
     raiſ
    -0.87
     Monfieur
    -0.86
     ſeveral
    -0.86
     Conſ
    -0.85
     Haarlem
    -0.83
    POSITIVE LOGITS
     /
    1.55
     /\
    1.04
     Mathis
    0.99
     /=
    0.98
    iwa
    0.94
     Rosenthal
    0.91
     /(\
    0.91
     Jansen
    0.85
     Pfeiffer
    0.85
     -
    0.84
    Act Density 0.096%

    No Known Activations