INDEX
Explanations
words that indicate the introduction or presentation of people and concepts
New Auto-Interp
Negative Logits
264
-0.07
ÑĢÑĸд
-0.07
/im
-0.07
endi
-0.07
vek
-0.07
ocha
-0.06
.getEnd
-0.06
-mf
-0.06
dale
-0.06
ochen
-0.06
POSITIVE LOGITS
iber
0.07
unfamiliar
0.06
concepts
0.06
Orientation
0.06
istar
0.06
kepada
0.06
uffs
0.06
ETA
0.06
bread
0.06
orient
0.06
Activations Density 0.007%