INDEX
Explanations
instances where a specified number of items are being referred to
references to groups or sets of items
New Auto-Interp
Negative Logits
Spoiler
-0.72
za
-0.68
Cosponsors
-0.65
ALLY
-0.64
virt
-0.63
Parables
-0.62
only
-0.62
gg
-0.61
imate
-0.61
srfAttach
-0.61
POSITIVE LOGITS
consisted
0.71
resulted
0.68
involves
0.68
involved
0.68
contained
0.65
originated
0.65
corresponds
0.64
referred
0.63
consists
0.63
occurs
0.62
Activations Density 0.075%