INDEX
Explanations
words related to laws, regulations, or official actions
references to legislative or regulatory measures
New Auto-Interp
Negative Logits
osta
-0.78
sites
-0.76
Reserve
-0.72
Forsaken
-0.70
Ruin
-0.68
opath
-0.68
assets
-0.68
oooooooooooooooo
-0.67
si
-0.66
ciating
-0.66
POSITIVE LOGITS
MENTS
0.86
witz
0.82
ment
0.80
arian
0.78
ters
0.76
hof
0.74
ments
0.73
terday
0.73
ter
0.72
xual
0.71
Activations Density 0.020%