INDEX
Explanations
the concept of principles or foundational ideas in various contexts
New Auto-Interp
Negative Logits
Ediciones
-0.76
hoebe
-0.71
berea
-0.67
eye
-0.65
Guarda
-0.64
дарь
-0.64
Arag
-0.63
aget
-0.63
tilde
-0.62
flare
-0.62
POSITIVE LOGITS
principles
1.60
Principles
1.57
Principles
1.51
Principle
1.51
principle
1.49
PRINCIPLES
1.47
PRINCIP
1.38
principles
1.37
principle
1.34
Principle
1.34
Activations Density 0.057%