INDEX
Explanations
words related to obedience and disobedience
concepts related to obedience and disobedience
New Auto-Interp
Negative Logits
spores
-0.73
Fargo
-0.71
orescent
-0.71
Springfield
-0.70
oan
-0.69
Helsinki
-0.68
reens
-0.67
bourg
-0.67
VAT
-0.66
Brussels
-0.65
POSITIVE LOGITS
edient
1.36
obedience
1.28
edience
1.19
obe
1.18
obedient
1.06
disob
1.01
obey
0.98
disobedience
0.93
ĪĴ
0.87
discipl
0.84
Activations Density 0.019%