INDEX
Explanations
words related to exits or departures
phrases that indicate movement or transitions out of a place or situation
New Auto-Interp
Negative Logits
avorite
-0.70
tyr
-0.67
arsen
-0.64
laureate
-0.61
etched
-0.60
hedral
-0.59
uzzle
-0.58
Eternity
-0.57
turnover
-0.57
examiner
-0.57
POSITIVE LOGITS
stretched
1.19
fitted
1.15
casts
1.07
smart
1.03
lander
1.01
doing
0.99
flows
0.98
lier
0.98
doors
0.97
bur
0.97
Activations Density 0.069%