INDEX
Explanations
questions and references to origins and sources of authority or information
New Auto-Interp
Negative Logits
ambién
-0.41
conmigo
-0.41
llevo
-0.40
registrado
-0.37
싶
-0.37
Jenderal
-0.36
preuves
-0.36
vicepresidente
-0.35
prévue
-0.35
éc
-0.35
POSITIVE LOGITS
source
1.03
sources
0.94
来源
0.91
來源
0.88
source
0.88
nguồn
0.87
Source
0.82
ource
0.82
Sources
0.81
sources
0.80
Activations Density 0.456%