INDEX
Explanations
film classification ratings and categories
New Auto-Interp
Negative Logits
pit
-0.15
isse
-0.15
regon
-0.14
pit
-0.14
ARK
-0.14
身份
-0.14
ushima
-0.14
itchen
-0.14
Hä
-0.13
ervers
-0.13
POSITIVE LOGITS
rating
0.42
rated
0.41
Rating
0.39
Rating
0.38
-rated
0.38
PG
0.37
Rated
0.36
rating
0.35
-rating
0.35
ratings
0.34
Activations Density 0.038%