INDEX
Explanations
punctuation and formatting-related elements in text
New Auto-Interp
Head Attr Weights
0:0.07
1:0.03
2:0.04
3:0.03
4:0.03
5:0.03
6:0.38
7:0.03
8:0.04
9:0.21
10:0.02
11:0.03
Negative Logits
HHS
-4.00
Rubio
-3.86
Malta
-3.73
Katrina
-3.58
flu
-3.57
HERO
-3.52
Puerto
-3.52
Saiyan
-3.47
MTA
-3.41
Snapchat
-3.40
POSITIVE LOGITS
Wood
9.86
Wood
9.52
wood
7.70
Woods
6.81
WOOD
6.63
wood
6.38
Woody
5.85
Wooden
5.84
Woodward
5.78
woods
5.75
Activations Density 0.010%