INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     film
    -2.63
     movie
    -2.30
    film
    -2.27
     films
    -2.23
     Film
    -2.22
    Film
    -2.17
     FILM
    -2.05
    movie
    -2.02
     movies
    -1.95
     Movie
    -1.95
    POSITIVE LOGITS
     Mep
    0.53
     sisält
    0.49
    ensation
    0.48
    expandindo
    0.48
     queima
    0.48
     pirata
    0.47
     authorship
    0.47
     radix
    0.47
     vastaan
    0.47
    Diabetes
    0.46
    Act Density 0.115%

    No Known Activations