INDEX
Explanations
patterns of text that are likely structured as lists or bullet points
occurrences of parentheses or similar symbols in the text
New Auto-Interp
Negative Logits
Tall
-0.76
Ups
-0.73
spir
-0.68
halfway
-0.68
Ros
-0.67
tackling
-0.66
Hung
-0.63
PowerShell
-0.62
Rising
-0.61
Rust
-0.61
POSITIVE LOGITS
iii
1.23
ii
1.21
*)
1.18
...)
1.16
1
1.16
emphasis
1.15
â̦)
1.14
II
1.09
optional
1.05
iv
1.05
Activations Density 0.054%