INDEX
Explanations
mentions of the word "drama."
references to drama in various contexts
New Auto-Interp
Negative Logits
ovo
-0.79
ibel
-0.77
anza
-0.74
erers
-0.73
achev
-0.73
itude
-0.71
ees
-0.70
ossier
-0.69
olphin
-0.68
imates
-0.68
POSITIVE LOGITS
queens
0.96
drama
0.90
dramas
0.87
unfold
0.85
queen
0.81
unfolds
0.80
involving
0.75
comed
0.73
Reign
0.73
unfolded
0.72
Activations Density 0.020%