INDEX
    Explanations

    the presence of specific structural markers indicating the start of a document or section

    New Auto-Interp
    Negative Logits
    Portale
    -1.23
     виправивши
    -0.95
    AsUp
    -0.89
    Према
    -0.88
    GraphicsUnit
    -0.86
    脚注の使い方
    -0.85
    }$​
    -0.83
     дописавши
    -0.82
     ]
    
    -0.81
    )}</
    -0.78
    POSITIVE LOGITS
     width
    0.66
    width
    0.55
     Width
    0.53
     шири
    0.52
     WIDTH
    0.50
    Aware
    0.49
    Width
    0.48
     largeur
    0.46
    WIDTH
    0.45
    height
    0.44
    Act Density 0.028%

    No Known Activations