INDEX
Explanations
references to movies, books, and characters from various franchises or series
references to popular television series and their associated characters or themes
New Auto-Interp
Negative Logits
seiz
-0.71
lb
-0.69
reconc
-0.68
HRC
-0.66
Fax
-0.65
disag
-0.62
tenance
-0.62
vulner
-0.62
lawy
-0.61
damages
-0.61
POSITIVE LOGITS
trilogy
1.37
mythology
1.23
novels
1.21
sequels
1.18
franchise
1.17
franchises
1.17
lore
1.10
classics
1.09
series
1.09
universes
1.09
Activations Density 0.409%