INDEX
Explanations
proper nouns or names within a list
phrases that indicate the presence of other entities or items in a list
New Auto-Interp
Negative Logits
govtrack
-0.71
fit
-0.61
glim
-0.61
rison
-0.58
owed
-0.55
Procedure
-0.55
Correct
-0.54
culp
-0.54
oshenko
-0.53
agon
-0.52
POSITIVE LOGITS
others
1.18
countless
1.13
innumerable
1.02
myriad
1.01
many
0.98
dozens
0.93
numerous
0.87
else
0.83
much
0.82
etc
0.81
Activations Density 0.168%