INDEX
Explanations
phrases that quantify or express a point of view about minimum standards or reliability
New Auto-Interp
Negative Logits
usta
-0.16
zas
-0.15
olls
-0.15
çļĦæĺ¯
-0.15
rou
-0.15
Lilly
-0.15
isci
-0.15
Zah
-0.14
elize
-0.14
pto
-0.14
POSITIVE LOGITS
èĩ³å°ij
0.46
minimum
0.43
atleast
0.40
least
0.35
minimum
0.35
Minimum
0.33
Minimum
0.33
alespoÅĪ
0.31
Least
0.30
ØŃداÙĤÙĦ
0.30
Activations Density 0.092%