INDEX
Explanations
references to legal cases and their outcomes
New Auto-Interp
Negative Logits
thing
-1.71
THING
-1.25
thing
-1.21
things
-1.20
Thing
-1.19
Thing
-1.13
Things
-1.08
things
-1.07
Things
-1.05
THINGS
-0.97
POSITIVE LOGITS
4
0.76
8
0.72
6
0.72
7
0.70
9
0.69
3
0.68
5
0.66
AttributeSet
0.57
2
0.54
endregion
0.49
Activations Density 1.433%