INDEX
    Explanations

    quotation marks indicating speech or dialogue

    New Auto-Interp
    Negative Logits
     adjud
    -0.72
     favour
    -0.71
     appro
    -0.71
     tro
    -0.67
     repro
    -0.67
     prelim
    -0.66
     lowly
    -0.66
     prag
    -0.66
     disappro
    -0.65
     prec
    -0.64
    POSITIVE LOGITS
    We
    1.24
    They
    1.15
    Our
    1.14
    It
    1.13
    There
    1.11
    I
    1.09
    Everything
    1.08
    Because
    1.06
    If
    1.05
    What
    1.05
    Act Density 0.133%

    No Known Activations