INDEX
Explanations
numbers and foreign words
introductions and explanations
New Auto-Interp
Negative Logits
k
0.41
ES
0.38
IP
0.37
Organisations
0.37
Minerals
0.36
ICOS
0.36
IME
0.36
ANTS
0.36
OPS
0.36
Fluids
0.35
POSITIVE LOGITS
他的
0.42
senang
0.42
自己的
0.41
ሁለት
0.40
второй
0.40
zwei
0.38
ಎರಡು
0.38
ખૂબ
0.38
อย่าง
0.37
hesitated
0.37
Activations Density 6.812%