INDEX
    Explanations

    instances of the word "new"

    New Auto-Interp
    Negative Logits
     ویکی‌پدیای
    -0.70
    <?>
    -0.65
    vège
    -0.61
    <>());
    -0.58
    Obras
    -0.57
     dna
    -0.56
    <>("
    -0.55
    homonymie
    -0.54
    [])
    
    -0.54
    ízo
    -0.53
    POSITIVE LOGITS
     new
    1.29
    new
    0.97
     नए
    0.83
     nuevos
    0.83
     neuen
    0.78
     novos
    0.77
     CreateTagHelper
    0.75
     nuovi
    0.70
    __(/*!
    0.68
     nuevas
    0.68
    Act Density 0.098%

    No Known Activations