INDEX
Explanations
phrases related to abundance and scarcity
New Auto-Interp
Negative Logits
enz
-0.15
vero
-0.15
ottle
-0.14
.mb
-0.14
ocale
-0.14
ouden
-0.14
pios
-0.14
odka
-0.13
ixin
-0.13
RITE
-0.13
POSITIVE LOGITS
nor
0.43
nor
0.33
anymore
0.31
Nor
0.31
Nor
0.30
NOR
0.22
Norris
0.19
بÙĦÚ©Ùĩ
0.19
sondern
0.18
ani
0.17
Activations Density 0.277%