INDEX
Explanations
narratives involving conflict, sacrifice, and powerful abilities
New Auto-Interp
Negative Logits
posedge
-0.56
Claus
-0.55
OfYear
-0.53
vedad
-0.52
textual
-0.50
fficio
-0.50
Project
-0.49
Literatuur
-0.49
SCHEMA
-0.49
tocin
-0.48
POSITIVE LOGITS
__':
0.73
fighting
0.71
fight
0.66
ThroughAttribute
0.65
fight
0.65
FIGHT
0.62
battles
0.62
fought
0.61
fighting
0.61
attacking
0.61
Activations Density 0.266%