INDEX
    Explanations

    instances of quotation marks and dialogue in the text

    New Auto-Interp
    Negative Logits
     propOrder
    -1.00
     queſta
    -0.88
    ロウィン
    -0.84
     ujednoznacz
    -0.82
    bootstrapcdn
    -0.81
    ſelben
    -0.80
     müſſen
    -0.79
     مرئيه
    -0.79
    <unused41>
    -0.78
    <unused68>
    -0.78
    POSITIVE LOGITS
    0.85
     "
    0.71
    "
    0.69
    ("
    0.59
    The
    0.56
     '
    0.56
     “
    0.56
    0.54
    If
    0.53
    0.53
    Act Density 0.002%

    No Known Activations