INDEX
Explanations
instances of phrases indicating uniqueness or superlatives
phrases that indicate distinct categories or classifications
New Auto-Interp
Negative Logits
words
-0.62
ļéĨĴ
-0.55
yards
-0.54
carriers
-0.54
cdn
-0.53
Minutes
-0.52
meanwhile
-0.52
ming
-0.51
reading
-0.50
tyard
-0.50
POSITIVE LOGITS
kind
1.45
type
1.11
kinds
1.10
Kind
1.06
sort
1.05
kind
1.02
sorts
1.01
size
0.94
Kind
0.93
type
0.90
Activations Density 0.079%