INDEX
Explanations
names of comic book characters and related terms
references to popular culture, particularly comic book characters and events
New Auto-Interp
Negative Logits
minist
-0.59
reek
-0.57
horm
-0.55
EVENT
-0.53
atform
-0.51
WAY
-0.50
wise
-0.49
MOD
-0.49
glim
-0.49
Reviewer
-0.48
POSITIVE LOGITS
respectively
1.00
_.
0.89
.''.
0.88
fame
0.83
.''
0.82
)).
0.82
*.
0.82
.).
0.78
).
0.76
]."
0.76
Activations Density 1.263%