INDEX
Explanations
phrases related to upward or downward movement or change
terms related to hierarchical structures and actions of raising or lowering standards
New Auto-Interp
Negative Logits
clave
-0.66
iuses
-0.64
izon
-0.63
matter
-0.63
SOURCE
-0.61
gans
-0.60
vier
-0.59
sylv
-0.58
deal
-0.58
zig
-0.57
POSITIVE LOGITS
stakes
1.08
ceilings
1.03
prices
1.02
expectations
1.01
altitude
1.01
levels
0.99
temperatures
0.98
heights
0.97
temperature
0.94
eyebrows
0.90
Activations Density 0.383%