INDEX
Explanations
phrases related to principles, beliefs, or teachings
references to religious or philosophical doctrines
New Auto-Interp
Negative Logits
netflix
-0.81
hots
-0.77
RGB
-0.74
umber
-0.69
Bron
-0.67
angan
-0.62
SB
-0.62
ASH
-0.62
Shiny
-0.61
umers
-0.60
POSITIVE LOGITS
doctrine
3.69
Doctrine
2.89
doctrines
2.88
dogma
2.02
doctr
1.98
theology
1.71
orthodoxy
1.63
teachings
1.52
theory
1.42
heresy
1.35
Activations Density 0.016%