INDEX
Explanations
punctuation marks, specifically periods
New Auto-Interp
Negative Logits
pus
-0.79
steering
-0.67
advis
-0.65
onboard
-0.62
ribbon
-0.61
drill
-0.59
pim
-0.59
plaque
-0.58
underpin
-0.58
axe
-0.57
POSITIVE LOGITS
$.
1.10
Va
1.07
S
0.93
Nations
0.82
¢
0.80
STATES
0.78
western
0.77
ĺ
0.76
ª
0.75
Ļ
0.75
Activations Density 0.039%