INDEX
Explanations
references to the Star Wars franchise and its related media
New Auto-Interp
Negative Logits
hir
-0.15
Scene
-0.14
Tits
-0.14
avo
-0.14
nek
-0.14
Scene
-0.14
odium
-0.14
fov
-0.13
odi
-0.13
isay
-0.13
POSITIVE LOGITS
universe
0.34
canon
0.33
continuity
0.32
univers
0.29
Universe
0.28
Canon
0.28
franchise
0.28
lore
0.27
cannon
0.27
Canon
0.26
Activations Density 0.125%