INDEX
Explanations
words related to television dramas
terms related to drama in various contexts
New Auto-Interp
Negative Logits
ovo
-0.87
achev
-0.74
ossier
-0.72
ibel
-0.71
olphin
-0.70
erers
-0.70
enhagen
-0.67
ogether
-0.67
itude
-0.67
imus
-0.66
POSITIVE LOGITS
queens
1.02
drama
0.90
queen
0.85
dramas
0.85
unfold
0.83
unfolds
0.80
theater
0.78
airs
0.77
comed
0.75
actor
0.75
Activations Density 0.033%