INDEX
    Explanations

    ellipses or indications of omitted text

    New Auto-Interp
    Negative Logits
    twimg
    -0.77
     Efq
    -0.77
     Bär
    -0.73
     ‪
    -0.70
     cuir
    -0.70
     dissa
    -0.69
     '\''
    -0.68
    heartedly
    -0.68
     
    -0.68
     Voyez
    -0.67
    POSITIVE LOGITS
     ...
    1.42
     …
    1.31
     ....
    1.07
     ..."
    1.00
     ...)
    0.96
     restTemplate
    0.95
     ..
    0.94
     ...
    
    0.93
     ·
    0.93
     .....
    0.91
    Act Density 0.128%

    No Known Activations