INDEX
Explanations
references to epic stories or narratives in literature
New Auto-Interp
Negative Logits
estar
-0.07
tle
-0.07
Kraj
-0.07
alia
-0.06
agen
-0.06
ties
-0.06
tur
-0.06
bridge
-0.06
thing
-0.06
alet
-0.06
POSITIVE LOGITS
entre
0.11
enter
0.10
-length
0.10
ALLY
0.09
proportions
0.08
urious
0.08
ure
0.08
ulously
0.08
RIX
0.07
çĦ¶
0.07
Activations Density 0.005%