INDEX
Explanations
references to dragons or fantastical elements in narratives
New Auto-Interp
Negative Logits
_lane
-0.17
Kansas
-0.16
_LANE
-0.16
OSP
-0.15
osas
-0.14
Kansas
-0.14
etroit
-0.14
afone
-0.14
seksi
-0.14
maduras
-0.14
POSITIVE LOGITS
Tooth
0.39
dragon
0.38
dragons
0.37
Viking
0.36
Vikings
0.35
Dragon
0.34
Dragons
0.34
Norse
0.32
dragon
0.31
Dragon
0.30
Activations Density 0.005%