INDEX
Explanations
the letters "N" and "L" followed or preceded by other letters
Unusual capitalization/formatting
New Auto-Interp
Negative Logits
N
-1.17
Ν
-0.93
NE
-0.81
ন
-0.77
न
-0.75
Ne
-0.74
Н
-0.74
Nis
-0.73
NA
-0.73
Ни
-0.71
POSITIVE LOGITS
ne
0.84
ni
0.84
na
0.84
n
0.83
no
0.81
nn
0.78
nnnn
0.76
now
0.74
ny
0.74
nine
0.73
Activations Density 0.881%