INDEX
Explanations
words related to obstacles or impediments
references to barriers in various contexts
New Auto-Interp
Negative Logits
ovy
-0.75
fg
-0.66
ovie
-0.66
intend
-0.66
eared
-0.65
XM
-0.64
Quote
-0.62
ön
-0.62
Stock
-0.61
onna
-0.61
POSITIVE LOGITS
barriers
3.78
barrier
3.67
Barrier
2.64
obstacles
1.64
hurdles
1.64
imped
1.50
hurdle
1.44
obstacle
1.38
walls
1.33
restraints
1.33
Activations Density 0.020%