INDEX
Explanations
phrases or sentences within brackets
instances of brackets or symbols indicating segmentation or grouping
New Auto-Interp
Negative Logits
cones
-0.71
intellig
-0.71
equivalents
-0.71
seams
-0.70
poisoning
-0.69
stagger
-0.69
values
-0.68
ores
-0.67
piping
-0.67
transported
-0.67
POSITIVE LOGITS
...]
1.55
â̦]
1.54
Pg
1.29
!]
1.22
](
1.07
AUT
1.06
actionDate
1.05
?]
1.04
paragraph
1.03
REDACTED
0.98
Activations Density 0.027%