INDEX
Explanations
movie directors
instances of the word "dir" and its variations, indicating a focus on film direction and related terminology
New Auto-Interp
Negative Logits
Audit
-0.70
Dragonbound
-0.67
ATES
-0.65
Partnership
-0.62
ATURES
-0.62
ATURE
-0.62
Alert
-0.62
Button
-0.61
Term
-0.61
arella
-0.61
POSITIVE LOGITS
ited
0.95
kish
0.95
kies
0.95
sonian
0.95
kers
0.94
ried
0.92
bing
0.91
kie
0.90
ned
0.89
ffer
0.88
Activations Density 0.073%