INDEX
    Explanations

    quotes or statements reflecting opinions and sentiments

    Quotes or quotation marks

    closing quotation marks

    New Auto-Interp
    Negative Logits
    ).
    
    -0.94
    》.
    -0.89
    /).
    -0.82
     ).
    -0.79
    `).
    -0.78
    %).
    -0.78
    ).
    -0.76
    .";
    
    -0.75
    .");
    
    -0.75
    "];
    
    -0.72
    POSITIVE LOGITS
    ,”
    1.16
    ,"
    1.13
    ”,
    1.00
    ",
    1.00
    ,”
    0.97
    “,
    0.94
    ',"
    0.92
    ,'
    0.91
    ,''
    0.90
    ',
    0.88
    Act Density 0.214%

    No Known Activations