INDEX
Explanations
phrases related to regulation and legal discussions
New Auto-Interp
Negative Logits
steroids
-0.74
abouts
-0.71
reserves
-0.66
Ferdinand
-0.64
whereabouts
-0.63
displeasure
-0.63
rity
-0.63
admin
-0.62
acre
-0.61
lled
-0.61
POSITIVE LOGITS
³³³³³³³³
1.14
³³³³³³³³³³³³³³³³
1.12
³³³
1.07
³³³³
1.04
"...
0.93
"â̦
0.87
Feature
0.81
³³
0.79
Liter
0.78
Fre
0.77
Activations Density 0.062%