INDEX
Explanations
elements that indicate legal or judicial contexts
New Auto-Interp
Negative Logits
nuanced
-0.98
incentiv
-0.96
Notably
-0.88
underwhelming
-0.85
blurry
-0.82
microbiome
-0.80
impactful
-0.79
overarching
-0.79
backstory
-0.78
prioritize
-0.77
POSITIVE LOGITS
muß
0.90
faßt
0.78
mußte
0.76
mußten
0.76
Moslem
0.75
daß
0.74
skall
0.70
läßt
0.68
wußte
0.68
müßte
0.66
Activations Density 11.970%