INDEX
    Explanations

    occurrences of quotation marks in the text

    New Auto-Interp
    Negative Logits
    ajur
    -0.75
    aData
    -0.69
    fleisch
    -0.69
     Fergus
    -0.68
    likle
    -0.67
    的很
    -0.65
     Ortiz
    -0.64
     Percival
    -0.64
     Gier
    -0.62
    Viitteet
    -0.62
    POSITIVE LOGITS
    ",
    1.46
    )",
    1.43
    ?",
    1.35
    '",
    1.28
    ]",
    1.26
    )".
    1.24
    }",
    1.23
    ,",
    1.21
    $",
    1.18
    \"",
    1.18
    Act Density 0.103%

    No Known Activations