INDEX
Explanations
legal terminologies and phrases indicating orders, actions, and relationships in legal texts
New Auto-Interp
Negative Logits
gatan
-0.71
становника
-0.64
évaluateur
-0.64
defaultstate
-0.61
мәкал
-0.58
définiti
-0.58
nærm
-0.57
exactly
-0.56
Referencie
-0.55
ῖν
-0.55
POSITIVE LOGITS
other
0.72
Other
0.70
other
0.65
Other
0.63
otros
0.62
还有
0.61
還有
0.59
autre
0.57
Еще
0.57
Also
0.57
Activations Density 0.551%