INDEX
Explanations
occurrences of the word "the" and related phrases indicating breaking or changing patterns or norms
New Auto-Interp
Negative Logits
irth
-0.17
uzzy
-0.14
bsd
-0.14
Ùħخت
-0.14
turb
-0.14
bourg
-0.13
erals
-0.13
eterminate
-0.13
rif
-0.13
RIX
-0.13
POSITIVE LOGITS
barrier
0.31
barriers
0.29
mold
0.28
mould
0.28
Barrier
0.26
deadlock
0.26
seal
0.25
bonds
0.24
Barrier
0.24
spell
0.24
Activations Density 0.027%