INDEX
    Explanations

    words and phrases related to novelty and newness

    New Auto-Interp
    Negative Logits
    canActivate
    -0.52
    énario
    -0.51
    |}{}
    -0.51
    equila
    -0.49
    -0.46
    AtIndexPath
    -0.44
    ribune
    -0.44
    getValueAt
    -0.44
    ymce
    -0.44
     hemma
    -0.44
    POSITIVE LOGITS
     new
    2.48
    new
    2.30
     nueva
    2.11
     nuevo
    2.11
    2.06
    新的
    2.00
     nuevos
    2.00
     nuevas
    1.93
     nouvelle
    1.92
     NEW
    1.92
    Act Density 0.324%

    No Known Activations