INDEX
Explanations
phrases related to legal actions or law enforcement
sentences, particularly those expressing opinions or statements
New Auto-Interp
Negative Logits
tremend
-0.81
therap
-0.78
carbohyd
-0.78
elig
-0.78
volunte
-0.78
skelet
-0.76
warr
-0.71
affili
-0.71
bidder
-0.70
tyr
-0.70
POSITIVE LOGITS
Thankfully
1.30
Whether
1.30
However
1.27
Luckily
1.27
But
1.25
Fortunately
1.24
Unfortunately
1.22
Sure
1.19
Often
1.18
Usually
1.17
Activations Density 0.511%