INDEX
Explanations
references to actors and their roles in the film industry
New Auto-Interp
Negative Logits
abi
-0.16
InputElement
-0.15
iaux
-0.15
ABI
-0.15
elsing
-0.14
à¸ķร
-0.14
ób
-0.14
ÃŃl
-0.14
afort
-0.14
λÏī
-0.13
POSITIVE LOGITS
actor
0.86
actors
0.85
acting
0.81
Actor
0.76
actor
0.75
Actors
0.75
Actor
0.69
actors
0.67
Acting
0.66
actress
0.65
Activations Density 0.366%