INDEX
Explanations
phrases expressing a degree of judgment or opinion about various subjects
New Auto-Interp
Negative Logits
Dee
-0.15
бина
-0.14
.kode
-0.14
rab
-0.14
emic
-0.14
akov
-0.14
ift
-0.14
fit
-0.14
ctr
-0.13
authority
-0.13
POSITIVE LOGITS
compens
0.15
ÅĻet
0.15
گاÙĩÛĮ
0.14
.glide
0.14
IDGET
0.14
ома
0.14
æ»
0.14
.Invariant
0.14
tty
0.14
iges
0.13
Activations Density 0.100%