INDEX
Explanations
phrases indicating probability or likelihood
New Auto-Interp
Negative Logits
ویکیپدی
-0.50
KURZBESCHREIBUNG
-0.46
///</
-0.43
دانشنامهٔ
-0.42
cuaderno
-0.41
Mixture
-0.40
tened
-0.40
mixture
-0.40
Infórmanos
-0.40
isation
-0.39
POSITIVE LOGITS
няка
0.50
anskje
0.50
שוליים
0.50
OrNil
0.44
ragalactic
0.44
krát
0.44
shtml
0.43
épar
0.43
Chances
0.43
sidemargin
0.43
Activations Density 0.008%