INDEX
Explanations
words related to obedience and disobedience
words and phrases associated with obedience and disobedience
New Auto-Interp
Negative Logits
Fargo
-0.75
NetMessage
-0.75
oan
-0.72
Springfield
-0.71
Winnipeg
-0.70
ewater
-0.69
Ambro
-0.67
Helsinki
-0.66
Paste
-0.66
odor
-0.65
POSITIVE LOGITS
obe
1.14
obedience
1.13
edient
1.12
obedient
1.03
obey
0.95
disob
0.91
edience
0.90
fulness
0.87
paio
0.87
heed
0.85
Activations Density 0.026%