INDEX
Explanations
references to actors
references to actors and actresses
New Auto-Interp
Negative Logits
aults
-0.75
yrs
-0.70
tops
-0.68
wn
-0.64
hops
-0.63
ills
-0.63
RESULTS
-0.63
compl
-0.62
ür
-0.62
cling
-0.62
POSITIVE LOGITS
actor
3.84
actress
2.83
Actor
2.64
actors
2.58
Actor
2.49
Actress
1.94
filmmaker
1.81
comedian
1.78
actor
1.73
singer
1.69
Activations Density 0.020%