INDEX
    Explanations

    references to specific outcomes or evaluations in various contexts

    New Auto-Interp
    Negative Logits
    BeginInit
    -0.57
     المعيارى
    -0.54
    upyter
    -0.53
    க்கு
    -0.53
     natale
    -0.52
    FieldBuilder
    -0.50
    ebaran
    -0.49
    rages
    -0.48
    ristor
    -0.48
    IsTrue
    -0.48
    POSITIVE LOGITS
     autorytatywna
    0.81
    <bos>
    0.72
     disambiguazione
    0.60
    IUrlHelper
    0.60
     ostavi
    0.59
     betweenstory
    0.56
    Erstellt
    0.55
     oprot
    0.55
     Biôgrafia
    0.54
    ItemBackground
    0.53
    Act Density 0.243%

    No Known Activations