INDEX
Explanations
instances of punctuation and formatting cues
New Auto-Interp
Negative Logits
realize
-0.22
localized
-0.21
Color
-0.21
Colors
-0.20
accompl
-0.20
realizes
-0.20
Color
-0.19
realized
-0.19
neighbor
-0.19
color
-0.18
POSITIVE LOGITS
whilst
0.34
Bearing
0.28
bearing
0.27
££
0.25
£
0.25
Whilst
0.24
bearing
0.24
NB
0.23
£
0.23
NB
0.23
Activations Density 0.854%