INDEX
Explanations
instances of the word "until."
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.07
4:0.16
5:0.03
6:0.12
7:0.06
8:0.06
9:0.03
10:0.07
11:0.25
Negative Logits
coli
-1.86
Chronic
-1.62
"/>
-1.59
tan
-1.56
delinquent
-1.53
}"
-1.52
*.
-1.50
hereafter
-1.50
.>>
-1.49
Sabb
-1.47
POSITIVE LOGITS
livious
2.46
iator
1.92
aughs
1.80
leans
1.80
reet
1.75
teasp
1.71
undai
1.69
affles
1.68
gettable
1.60
uador
1.59
Activations Density 0.001%