INDEX
Explanations
terms related to measurements of span or distance
arm span to height ratio
New Auto-Interp
Negative Logits
y
-0.43
<bos>
-0.41
מוכ
-0.40
it
-0.39
Ju
-0.38
jij
-0.38
InjectMocks
-0.38
♣
-0.38
ớ
-0.37
ie
-0.36
POSITIVE LOGITS
Span
1.19
span
1.16
Span
1.14
SPAN
1.12
span
1.08
SPAN
1.02
spans
0.93
TextSpan
0.85
spans
0.85
spanning
0.85
Activations Density 0.009%