INDEX
    Explanations

    instances of various writing attributes or authorship

    New Auto-Interp
    Negative Logits
    '];?>
    -0.69
    fromCharCode
    -0.65
    ')}
    -0.60
    ']?>
    -0.60
    ")}
    -0.58
    '),
    
    -0.58
    ')")
    -0.58
    })$}
    -0.58
    ()")
    -0.57
    "}},
    -0.56
    POSITIVE LOGITS
     kasarigan
    0.62
     câte
    0.61
    ThroughAttribute
    0.57
     newOwner
    0.57
     članak
    0.56
    ArgumentParser
    0.56
     erwähnten
    0.56
    curacies
    0.54
     tubeless
    0.54
     дописавши
    0.53
    Act Density 0.202%

    No Known Activations