INDEX
    Explanations

    phrases indicating liking, wanting, or caring about something

    Non-English words

    New Auto-Interp
    Negative Logits
     complete
    -1.25
    complete
    -1.19
     Complete
    -1.07
    Complete
    -1.04
     open
    -1.02
    open
    -0.91
     COMPLETE
    -0.90
     Open
    -0.87
     run
    -0.84
     secure
    -0.83
    POSITIVE LOGITS
     виправивши
    0.59
     naselje
    0.54
    økt
    0.51
     uintptr
    0.51
     enfans
    0.50
     humains
    0.48
    הערות
    0.48
     normaux
    0.47
     Manusia
    0.47
    AndroidJUnit
    0.47
    Act Density 6.833%

    No Known Activations