INDEX
Explanations
terms related to legal matters or legislation
New Auto-Interp
Negative Logits
иÑĤоÑĢ
-0.15
etry
-0.15
jsc
-0.15
eties
-0.14
apas
-0.14
Addr
-0.14
rak
-0.14
angent
-0.14
rage
-0.14
rod
-0.13
POSITIVE LOGITS
itimate
0.23
islation
0.21
gett
0.19
gings
0.18
ged
0.18
gers
0.18
imized
0.18
isl
0.17
finity
0.16
reta
0.16
Activations Density 0.020%