INDEX
Explanations
defining base classes or models
New Auto-Interp
Negative Logits
gods
0.40
elaboration
0.38
elaborate
0.38
BLY
0.37
nationalists
0.37
prelude
0.37
Gods
0.37
devils
0.36
eils
0.36
dissidents
0.35
POSITIVE LOGITS
Owner
0.56
Founder
0.50
Creator
0.49
Perfect
0.48
Owner
0.47
UserModel
0.46
Model
0.45
Designer
0.45
Constructor
0.45
Founder
0.45
Activations Density 0.016%