INDEX
Explanations
terms related to rising or ascending situations or states
New Auto-Interp
Negative Logits
ession
-0.17
eding
-0.16
doors
-0.16
arget
-0.15
lined
-0.15
roulette
-0.15
tons
-0.15
Downing
-0.15
ting
-0.15
downs
-0.15
POSITIVE LOGITS
phoenix
0.23
tide
0.22
above
0.21
Phoenix
0.21
-rise
0.20
bud
0.20
borough
0.20
Above
0.18
rise
0.18
Phoenix
0.18
Activations Density 0.031%