INDEX
Explanations
descriptions or discussions about specific elements within a larger context
references to game design and improvements
New Auto-Interp
Negative Logits
intervened
-0.68
govtrack
-0.65
apo
-0.64
istg
-0.64
Choice
-0.62
akening
-0.62
cession
-0.60
ALWAYS
-0.60
ollar
-0.60
fing
-0.59
POSITIVE LOGITS
similarities
1.17
resemblance
1.05
familiar
1.04
characteristics
1.04
resemb
1.00
wrinkles
0.97
themes
0.93
tropes
0.92
specs
0.92
specifications
0.92
Activations Density 1.039%