INDEX
Explanations
content related to legal processes and outcomes
Numbers following "to", "and", "by", or "below"
numbers and number phrases
New Auto-Interp
Negative Logits
&
-0.71
-0.66
<bos>
-0.62
Govt
-0.59
ppl
-0.55
govt
-0.54
<&
-0.53
refugi
-0.50
Assn
-0.49
enamorado
-0.48
POSITIVE LOGITS
four
1.35
five
1.34
twenty
1.33
six
1.30
seven
1.29
eight
1.28
three
1.28
nine
1.28
twelve
1.28
eighteen
1.27
Activations Density 0.269%