INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     liggen
    -0.89
    }`).
    -0.80
    lodash
    -0.79
     MediatR
    -0.76
    Życiorys
    -0.74
    
    
    -0.74
     auguri
    -0.73
     Infórmanos
    -0.73
    солю
    -0.72
     dieux
    -0.72
    POSITIVE LOGITS
     movies
    1.94
     movie
    1.89
     Movie
    1.59
     Movies
    1.52
     MOVIE
    1.51
    Movie
    1.44
    movie
    1.43
    Movies
    1.43
    movies
    1.41
    MOVIE
    1.16
    Act Density 0.047%

    No Known Activations