INDEX
Explanations
declarations or statements of significance
instances of the word "declare" in various forms
New Auto-Interp
Negative Logits
ographers
-0.76
partName
-0.75
ograp
-0.71
ographer
-0.71
alez
-0.66
ographical
-0.66
RH
-0.64
umbn
-0.64
aptic
-0.63
WARE
-0.63
POSITIVE LOGITS
bankruptcy
1.22
oneself
1.02
allegiance
1.01
war
1.01
victory
1.00
independence
0.99
martial
0.97
himself
0.91
herself
0.91
unequivocally
0.90
Activations Density 0.056%