INDEX
Explanations
principles (moral, legal, or fundamental truths)
references to 'principle'
New Auto-Interp
Negative Logits
tagged
-0.70
jer
-0.68
outs
-0.65
acked
-0.65
Crash
-0.64
iro
-0.64
asks
-0.63
irl
-0.63
pour
-0.62
Bust
-0.62
POSITIVE LOGITS
principle
3.73
Principle
2.48
principles
2.20
Principles
1.61
premise
1.50
theory
1.45
theorem
1.41
prin
1.40
concept
1.36
principals
1.31
Activations Density 0.005%