INDEX
Explanations
references to specific TV shows or characters from TV shows
references to the TV show "Arrow" and its related characters
New Auto-Interp
Negative Logits
uration
-0.87
urers
-0.83
orget
-0.82
employment
-0.80
milo
-0.76
mble
-0.76
xual
-0.76
employed
-0.73
ured
-0.73
enance
-0.71
POSITIVE LOGITS
Arrow
1.55
Lantern
0.92
Canary
0.81
Canyon
0.74
Edge
0.74
Creek
0.70
Tool
0.68
Dash
0.67
Protocol
0.67
Dexter
0.66
Activations Density 0.017%