INDEX
Explanations
references to superhero characters and their associated narratives
New Auto-Interp
Negative Logits
reed
-0.17
oby
-0.15
anca
-0.14
Highlander
-0.14
aeda
-0.14
zee
-0.14
asser
-0.14
omer
-0.14
ebin
-0.14
evin
-0.14
POSITIVE LOGITS
اÙĦعربÙĬ
0.15
Flip
0.14
525
0.14
Spinner
0.14
MetroFramework
0.14
815
0.14
pekt
0.14
Dare
0.14
Flip
0.13
contra
0.13
Activations Density 0.039%