INDEX
Explanations
backslashes and other escape characters in text
New Auto-Interp
Negative Logits
Ciro
-0.53
Abad
-0.49
rati
-0.49
est
-0.49
躇
-0.47
regent
-0.47
Lugo
-0.46
Migrant
-0.46
is
-0.46
pert
-0.45
POSITIVE LOGITS
împre
0.75
\{\\0.69
..\..\
0.67
<<"\
0.65
("\\0.64
barbati
0.62
zondere
0.61
"\
0.61
(/\
0.60
damskie
0.60
Activations Density 0.778%