INDEX
Explanations
numerical references, specifically ones indicating an upward trend or amount
phrases involving the concept of "upwards" or increasing values
New Auto-Interp
Negative Logits
tein
-0.73
Secrets
-0.71
Doctor
-0.71
Lieberman
-0.71
Mamm
-0.70
Thing
-0.69
Reviewed
-0.69
nen
-0.68
Doctor
-0.68
zig
-0.67
POSITIVE LOGITS
¥ŀ
1.24
etheless
1.08
srf
0.97
mathemat
0.96
©¶æ¥µ
0.95
proport
0.93
earthqu
0.93
tremend
0.93
upward
0.91
ende
0.91
Activations Density 0.003%