INDEX
Explanations
references to actors, especially focusing on their performance and involvement in various activities
references to actors in various contexts
New Auto-Interp
Negative Logits
fty
-0.65
PRESS
-0.65
THER
-0.63
coat
-0.62
plet
-0.61
top
-0.61
TOP
-0.60
yss
-0.60
Territories
-0.60
mbol
-0.60
POSITIVE LOGITS
actors
1.00
rities
0.98
actor
0.88
writers
0.88
actresses
0.82
acters
0.77
iov
0.74
actress
0.74
Cosponsors
0.74
chops
0.72
Activations Density 0.010%