INDEX
Explanations
numbers with units or in calculations
New Auto-Interp
Negative Logits
ordinal
0.53
ASCII
0.51
orchestr
0.49
jad
0.48
heist
0.46
TypeScript
0.46
lifespan
0.46
coexistence
0.45
dutiful
0.45
chore
0.45
POSITIVE LOGITS
8
0.82
7
0.79
6
0.77
5
0.77
2
0.74
9
0.72
1
0.68
3
0.66
4
0.65
0
0.55
Activations Density 0.444%