INDEX
Explanations
references to specific episodes
mentions of specific episodes of television shows
New Auto-Interp
Negative Logits
ntil
-0.90
enei
-0.83
sheets
-0.78
erness
-0.75
ple
-0.75
zza
-0.74
punk
-0.74
achev
-0.74
wegian
-0.73
lli
-0.73
POSITIVE LOGITS
episode
0.99
finale
0.89
episodes
0.89
aired
0.87
airing
0.87
airs
0.86
opener
0.82
Transcript
0.80
Episode
0.80
Episode
0.79
Activations Density 0.019%