INDEX
Explanations
sentences containing phrases expressing intensity, comparison, or evaluation
predicates and descriptions indicating states of being or existence
New Auto-Interp
Negative Logits
Deaths
-0.72
ainers
-0.71
Inventory
-0.69
Saud
-0.67
Artists
-0.64
Factors
-0.64
Landing
-0.64
Sources
-0.63
Hust
-0.61
Peoples
-0.61
POSITIVE LOGITS
indistinguishable
1.03
neither
1.00
antit
1.00
reminiscent
1.00
supposed
0.98
conducive
0.96
remotely
0.91
mutually
0.90
cedented
0.90
otherwise
0.90
Activations Density 0.205%