INDEX
Explanations
references to television shows and movies, including titles and production details
specific content about television shows or films, particularly references to their titles or thematic elements
New Auto-Interp
Negative Logits
spiral
-0.76
OTOS
-0.65
scatter
-0.64
Spit
-0.64
juggling
-0.63
rugby
-0.62
Kitty
-0.62
Fam
-0.62
jad
-0.61
scattering
-0.61
POSITIVE LOGITS
tis
0.85
then
0.83
cause
0.81
drivers
0.80
thus
0.75
which
0.74
shall
0.74
gypt
0.74
lr
0.74
say
0.72
Activations Density 0.128%