INDEX
Explanations
words related to obedience and disobedience
terms related to obedience and defiance
New Auto-Interp
Negative Logits
iser
-0.82
ikuman
-0.76
iaries
-0.75
iation
-0.75
ewater
-0.74
azines
-0.72
oan
-0.72
ivities
-0.72
ilitating
-0.71
rum
-0.69
POSITIVE LOGITS
obe
1.04
obedience
1.02
edient
1.00
fulness
0.84
disob
0.81
vironment
0.79
obey
0.79
iblical
0.77
guiActive
0.76
obedient
0.76
Activations Density 0.096%