INDEX
Explanations
religious terms or references
references to relatives or family relationships
New Auto-Interp
Negative Logits
wide
-0.71
Lead
-0.65
wider
-0.60
thinly
-0.60
Slate
-0.60
slate
-0.59
buck
-0.59
Marketplace
-0.57
Wilde
-0.57
Swan
-0.56
POSITIVE LOGITS
igion
1.70
iability
1.50
iable
1.47
atively
1.42
igious
1.39
atives
1.36
iever
1.36
iance
1.35
ativity
1.33
aunch
1.33
Activations Density 0.029%