INDEX
Explanations
terms related to justification and legal reasoning
New Auto-Interp
Negative Logits
+#+#
-0.72
Efq
-0.64
Verſ
-0.60
RegressionTest
-0.59
patate
-0.57
***!
-0.55
Geiſt
-0.55
featureID
-0.55
edicated
-0.52
SURFACE
-0.52
POSITIVE LOGITS
justified
0.81
justify
0.75
justify
0.74
justifies
0.72
justifying
0.71
justification
0.65
justified
0.57
gius
0.56
Jus
0.56
jus
0.56
Activations Density 0.173%