INDEX
Explanations
specific mentions of stories or news articles
frequent references to "story" and related narrative elements
New Auto-Interp
Negative Logits
ignt
-0.89
emale
-0.87
inking
-0.86
ategory
-0.86
ateurs
-0.84
omination
-0.84
amily
-0.83
omin
-0.82
orneys
-0.80
ojure
-0.79
POSITIVE LOGITS
telling
1.06
arc
0.95
revolving
0.94
arcs
0.85
board
0.84
boards
0.83
book
0.83
tell
0.79
books
0.79
Stories
0.78
Activations Density 0.044%