INDEX
Explanations
legal terms or phrases
mentions of legislation or related legal concepts
New Auto-Interp
Negative Logits
âķIJâķIJ
-0.85
xual
-0.85
IMAGES
-0.71
ÙIJ
-0.68
ï¸
-0.67
ãģ¦
-0.67
âĸ¬
-0.63
hirt
-0.62
hower
-0.62
ÙĴ
-0.62
POSITIVE LOGITS
itimate
1.32
isl
1.18
acy
1.16
acies
1.15
Leg
0.89
ivers
0.87
uin
0.87
flex
0.87
ogn
0.84
inally
0.83
Activations Density 0.010%