INDEX
    Explanations

    references to pictures or images in the text

    New Auto-Interp
    Negative Logits
    oys
    -0.50
     Cobalt
    -0.50
     Wulf
    -0.50
     Graff
    -0.49
    validators
    -0.48
     Hawke
    -0.47
    Auszeichnungen
    -0.47
     Vegan
    -0.47
     Shreve
    -0.47
    hyd
    -0.47
    POSITIVE LOGITS
    picture
    1.38
    Picture
    1.35
     picture
    1.34
     Picture
    1.32
     PICTURE
    1.31
    PICTURE
    1.23
    Pictures
    1.16
     pictures
    1.14
    pictures
    1.14
     Pictures
    1.09
    Act Density 0.113%

    No Known Activations