INDEX
Explanations
instances of apostrophes or apostrophe-related contractions
New Auto-Interp
Negative Logits
scal
-0.14
aN
-0.14
bos
-0.13
ắn
-0.13
LAB
-0.13
Laboratories
-0.13
avs
-0.13
ocode
-0.13
.Chain
-0.13
311
-0.13
POSITIVE LOGITS
undler
0.15
rophy
0.15
other
0.15
лом
0.15
Sokol
0.14
erb
0.14
PCM
0.14
ondon
0.14
imuth
0.14
Tep
0.14
Activations Density 0.012%