INDEX
    Explanations

    references to figures and tables in a document

    New Auto-Interp
    Negative Logits
    }")
    
    -0.67
    '):
    
    -0.63
    ")));
    
    -0.62
    )";
    
    -0.56
    )");
    
    -0.53
    _
    
    -0.53
    )))
    
    -0.53
    "):
    
    -0.53
    ")){
    
    -0.52
    abcdefghijklmnop
    -0.51
    POSITIVE LOGITS
    sidemargin
    0.69
    OGND
    0.67
    AnchorTagHelper
    0.67
     hunne
    0.64
     zijne
    0.61
     Mère
    0.60
     pareti
    0.60
     nødvendig
    0.59
     Osiris
    0.59
     feroit
    0.58
    Act Density 0.544%

    No Known Activations