INDEX
Explanations
words related to physical barriers or obstacles
terms related to barriers or constraints
New Auto-Interp
Negative Logits
croft
-0.78
e
-0.72
Accessory
-0.72
Raise
-0.70
Hastings
-0.69
Coffee
-0.66
conn
-0.64
lyss
-0.62
sched
-0.62
hold
-0.61
POSITIVE LOGITS
etr
1.65
agnetic
1.15
agog
1.04
agonal
1.01
ading
0.99
etrical
0.99
ansky
0.99
kefeller
0.94
obiles
0.93
acers
0.92
Activations Density 0.008%