INDEX
Explanations
references to specific names or places
New Auto-Interp
Negative Logits
inventoryQuantity
-0.81
Contra
-0.78
defic
-0.77
soType
-0.73
contrace
-0.73
Fairfax
-0.72
dilig
-0.71
confir
-0.70
indecent
-0.70
uyomi
-0.69
POSITIVE LOGITS
k
1.45
K
1.36
ks
1.35
kt
1.31
KC
1.27
ked
1.26
king
1.24
kan
1.23
KS
1.21
KE
1.19
Activations Density 0.131%