INDEX
Explanations
references to important episodes or conclusions in a series
terms related to television series finales
New Auto-Interp
Negative Logits
tics
-0.81
urg
-0.76
lying
-0.76
artisan
-0.76
uge
-0.75
Cola
-0.74
olds
-0.74
hist
-0.71
absolute
-0.71
guns
-0.71
POSITIVE LOGITS
finale
1.07
[|
0.88
prematurely
0.81
reel
0.80
spoiler
0.79
airs
0.78
adolesc
0.77
episode
0.76
APR
0.75
adelphia
0.75
Activations Density 0.012%