INDEX
Explanations
titles or specific phrases related to stories, questions, or guidelines
references to stories and questions, particularly in the context of children's literature and related topics
New Auto-Interp
Negative Logits
ously
-0.76
arb
-0.74
exception
-0.70
activity
-0.67
republican
-0.66
rolog
-0.65
forestry
-0.65
livion
-0.64
ris
-0.64
atively
-0.64
POSITIVE LOGITS
Advice
1.06
Parties
1.06
Characters
1.06
Ago
1.04
Shots
1.04
Places
1.02
Against
1.01
Dates
0.99
Without
0.98
Ahead
0.98
Activations Density 0.138%