INDEX
    Explanations

    ordinal numbers

    New Auto-Interp
    Negative Logits
    <bos>
    -0.74
    aarrggbb
    -0.73
    hamcrest
    -0.63
    kháu
    -0.60
     wildfires
    -0.55
    cerol
    -0.54
     Tafel
    -0.54
     marié
    -0.54
     otomatig
    -0.53
     CreateTagHelper
    -0.52
    POSITIVE LOGITS
     first
    0.81
    first
    0.79
     second
    0.64
     helst
    0.62
    First
    0.62
    occasion
    0.61
    second
    0.61
     primeira
    0.60
     time
    0.59
     временем
    0.59
    Act Density 0.062%

    No Known Activations