INDEX
Explanations
key phrases and indicators related to comparisons or contrasts
punctuation marks and structural elements that separate or conclude content sections.
New Auto-Interp
Negative Logits
-0.36
confirmación
-0.35
verificación
-0.30
ivasan
-0.29
craindre
-0.29
senador
-0.29
kracht
-0.27
emperador
-0.27
interacción
-0.27
constater
-0.27
POSITIVE LOGITS
The
1.88
The
1.45
THE
1.38
THE
1.37
the
0.92
Thé
0.88
ザ
0.88
ザ
0.88
TheGreat
0.88
ذا
0.86
Activations Density 0.142%