INDEX
Explanations
elements related to plot developments and character dynamics in a TV series
New Auto-Interp
Negative Logits
éľŀ
-0.15
_Selection
-0.15
DOG
-0.14
opinion
-0.14
urb
-0.14
odb
-0.14
ÑĮеÑĢ
-0.14
anan
-0.14
otel
-0.14
isay
-0.14
POSITIVE LOGITS
aktion
0.17
ÑĢÑĥг
0.15
pedia
0.15
ustr
0.15
ouz
0.14
Season
0.14
kara
0.14
uds
0.14
introdu
0.14
Wiki
0.14
Activations Density 0.223%