INDEX
Explanations
URLs and web links from various domains
website urls
New Auto-Interp
Negative Logits
صوتيه
-0.90
⟬
-0.88
NameInMap
-0.88
transQ
-0.85
kasarigan
-0.85
featureID
-0.85
betweenstory
-0.84
adaptiveStyles
-0.83
indígen
-0.82
ſind
-0.82
POSITIVE LOGITS
0.48
is
0.40
$
0.38
1
0.37
+
0.35
=
0.34
https
0.33
2
0.33
was
0.33
infatti
0.32
Activations Density 0.101%