INDEX
Explanations
phrases indicating significance or magnitude
New Auto-Interp
Negative Logits
encarga
-0.66
Ouch
-0.64
поводу
-0.61
hidupan
-0.58
arraycopy
-0.58
HandleFunc
-0.57
,"
-0.56
ยว
-0.56
nicely
-0.54
urdy
-0.53
POSITIVE LOGITS
greater
1.61
Greater
1.58
Greater
1.55
greater
1.54
GREATER
1.44
maior
1.05
greatest
1.02
greatest
1.00
maior
0.98
Lesser
0.98
Activations Density 0.095%