INDEX
Explanations
references to specific political figures and locations
U.S. state abbreviations after a parenthesis or comma
US states and abbreviations
New Auto-Interp
Negative Logits
Figure
-0.53
okay
-0.52
",
-0.50
”,
-0.49
]",
-0.48
Reverend
-0.47
".
-0.47
”.
-0.47
argli
-0.46
________________
-0.46
POSITIVE LOGITS
Oct
1.22
Fig
1.21
Sept
1.19
Aug
1.17
Fig
1.16
Gov
1.14
Feb
1.11
Nov
1.10
Figs
1.05
FIG
1.04
Activations Density 0.775%