INDEX
Explanations
structured data where comparable values or categories are being compared or listed
phrases that refer to comparative data or statistics
New Auto-Interp
Negative Logits
ocracy
-0.72
rog
-0.71
gravity
-0.70
ocrats
-0.69
gery
-0.69
ocratic
-0.68
Defenders
-0.66
rak
-0.64
enf
-0.64
yers
-0.63
POSITIVE LOGITS
proport
0.76
sidx
0.76
çͰ
0.74
guiActiveUn
0.72
thereafter
0.71
ãģĨ
0.71
sexes
0.70
situated
0.69
eleph
0.68
é¾įåĸļ士
0.67
Activations Density 0.011%