INDEX
    Explanations

    percentage-related or numerical data in the text

    percentages and degrees

    New Auto-Interp
    Negative Logits
    出版年
    -0.60
    ']
    
    -0.60
    '>
    
    -0.56
    ]
    -0.55
    ']
    -0.55
    ]
    
    -0.55
    "])
    
    -0.54
     }}$
    -0.54
    "]
    -0.53
    "]
    
    -0.53
    POSITIVE LOGITS
     €,
    0.62
     :).
    0.60
     %,
    0.60
    …,
    0.59
     ?,
    0.57
    %,
    0.56
    BibitemShut
    0.56
     (?,
    0.55
     ...,
    0.55
    ":"",
    0.55
    Act Density 0.169%

    No Known Activations