INDEX
Explanations
concepts or ideas based on principles
mentions of the word "principle" and its variations
New Auto-Interp
Negative Logits
minster
-0.81
reens
-0.69
NetMessage
-0.66
ammers
-0.66
ctic
-0.66
ombs
-0.66
akening
-0.66
eded
-0.65
attery
-0.64
orks
-0.64
POSITIVE LOGITS
ually
1.05
principle
0.99
cipled
0.89
principles
0.85
arily
0.81
ual
0.80
Principle
0.79
SourceFile
0.76
yout
0.75
ciples
0.75
Activations Density 0.014%