INDEX
Explanations
phrases associated with literary references and storytelling
New Auto-Interp
Negative Logits
ives
-0.17
å©
-0.15
hist
-0.14
å«
-0.14
abi
-0.14
799
-0.14
636
-0.14
Maher
-0.14
martial
-0.14
commercially
-0.14
POSITIVE LOGITS
Att
0.34
Scout
0.33
Finch
0.31
Mock
0.28
Boo
0.27
Mock
0.26
Att
0.25
mock
0.24
Harper
0.24
Scouts
0.23
Activations Density 0.002%