INDEX
Explanations
references to health, medical conditions, or symptoms
New Auto-Interp
Negative Logits
tvguidetime
-0.71
AssemblyProduct
-0.62
stris
-0.51
még
-0.49
tetap
-0.46
}[
-0.45
dur
-0.45
ainda
-0.44
Przypisy
-0.44
still
-0.44
POSITIVE LOGITS
entire
1.43
entire
1.35
ENTIRE
1.30
Entire
1.24
entirety
1.13
Entire
1.12
whole
0.95
WHOLE
0.90
whole
0.90
gesamten
0.85
Activations Density 0.339%