INDEX
    Explanations

    references to observational activities, particularly watching and listening

    New Auto-Interp
    Negative Logits
     retours
    -0.54
     profondità
    -0.50
    neri
    -0.50
    PackageManager
    -0.49
     felicità
    -0.49
    ñadir
    -0.48
     tebal
    -0.48
     InputDecoration
    -0.48
     idéia
    -0.48
    нуться
    -0.48
    POSITIVE LOGITS
     watching
    1.66
     watch
    1.62
     watched
    1.60
     Watched
    1.56
    Watching
    1.54
    watching
    1.51
     Watching
    1.49
     watches
    1.46
    Watched
    1.42
     WATCH
    1.42
    Act Density 0.150%

    No Known Activations