INDEX
    Explanations

    film classification ratings and categories

    New Auto-Interp
    Negative Logits
    pit
    -0.15
    isse
    -0.15
    regon
    -0.14
     pit
    -0.14
    ARK
    -0.14
    身份
    -0.14
    ushima
    -0.14
    itchen
    -0.14
     Hä
    -0.13
    ervers
    -0.13
    POSITIVE LOGITS
     rating
    0.42
     rated
    0.41
     Rating
    0.39
    Rating
    0.38
    -rated
    0.38
     PG
    0.37
     Rated
    0.36
    rating
    0.35
    -rating
    0.35
     ratings
    0.34
    Act Density 0.038%

    No Known Activations