INDEX
    Explanations

    technical terms and notation commonly used in programming or mathematical contexts

    New Auto-Interp
    Negative Logits
    featureID
    -0.92
     Савезне
    -0.90
     Italijanski
    -0.88
     Administrativna
    -0.86
    niſſe
    -0.84
    WithIOException
    -0.83
    Vidite
    -0.77
    -0.76
    <unused14>
    -0.75
    <unused52>
    -0.75
    POSITIVE LOGITS
     Anfitrión
    0.43
     tartalomajánló
    0.28
    +#+
    0.25
     Einwilligung
    0.24
    }}</
    0.24
     kirke
    0.23
     marbre
    0.22
    RenderAtEndOf
    0.22
    setViewportView
    0.22
     }}</
    0.21
    Act Density 19.742%

    No Known Activations