INDEX
    Explanations

    punctuation marks and their variations

    New Auto-Interp
    Negative Logits
     싶
    -0.58
    []
    
    -0.56
     المعيارى
    -0.55
    $")
    -0.54
     ankles
    -0.53
    rovna
    -0.53
    اتی
    -0.53
    %");
    -0.53
    textsc
    -0.52
    zai
    -0.51
    POSITIVE LOGITS
    verwijspagina
    0.94
     ujednoznacz
    0.91
     mxArray
    0.83
    awtextra
    0.83
     disambiguazione
    0.82
    Capítulo
    0.80
    __':
    0.75
    because
    0.75
     /\.(
    0.74
     albeit
    0.73
    Act Density 0.184%

    No Known Activations