INDEX
Explanations
phrases related to authority or commands
instances of personal statements or direct quotes regarding societal expectations and legal matters
New Auto-Interp
Negative Logits
partName
-0.74
argues
-0.68
sqor
-0.62
geoning
-0.61
PCR
-0.61
confines
-0.60
Auditor
-0.57
esters
-0.57
Attempts
-0.57
understandably
-0.56
POSITIVE LOGITS
"#
0.78
undermin
0.75
ĪĴ
0.71
evils
0.65
superiority
0.65
someday
0.62
immoral
0.61
fitt
0.60
imminent
0.60
TAMADRA
0.60
Activations Density 1.447%