INDEX
Explanations
positive descriptions of acting performances
New Auto-Interp
Negative Logits
ilaire
-0.46
Ptr
-0.44
ην
-0.44
simplifié
-0.42
setBorder
-0.42
ieteur
-0.41
clearInterval
-0.41
歌词
-0.41
節目
-0.41
Sécurité
-0.41
POSITIVE LOGITS
actors
1.08
actor
0.99
actress
0.97
Actor
0.92
Actors
0.92
actresses
0.92
Actors
0.89
actors
0.86
acting
0.83
Actor
0.82
Activations Density 0.198%