INDEX
Explanations
phrases indicating varying levels of quality or size
New Auto-Interp
Negative Logits
gorą
-0.66
RegressionTest
-0.57
ALWAYS
-0.57
Zapo
-0.54
Totally
-0.52
typeof
-0.52
Always
-0.51
anyak
-0.51
servez
-0.51
Always
-0.50
POSITIVE LOGITS
decent
0.88
decently
0.81
considerable
0.77
considerable
0.76
fairly
0.75
непло
0.74
sizable
0.74
Decent
0.70
Abraço
0.70
sizeable
0.70
Activations Density 0.255%