INDEX
Explanations
references to dragons or dragon-related themes
New Auto-Interp
Negative Logits
SSION
-0.16
OSP
-0.16
anken
-0.14
iliz
-0.14
Doll
-0.14
ellung
-0.14
ModelProperty
-0.14
ÙĪØ²
-0.14
Dice
-0.14
pol
-0.14
POSITIVE LOGITS
dragons
0.49
dragon
0.48
drag
0.43
Dragons
0.43
Dragon
0.42
Drag
0.41
dragon
0.40
Dragon
0.39
drag
0.36
.drag
0.35
Activations Density 0.011%