INDEX
Explanations
words related to formatting and structure in text and the concept of different options or choices
punctuation and formatting characters within the text
New Auto-Interp
Negative Logits
Glac
-0.74
glac
-0.64
1920
-0.62
sixty
-0.62
shotguns
-0.60
hurd
-0.59
960
-0.59
fifty
-0.58
ãĥķãĤ©
-0.58
retard
-0.57
POSITIVE LOGITS
3
1.34
3
1.13
Third
1.04
Third
1.00
III
0.98
iii
0.93
333
0.92
III
0.92
iii
0.90
third
0.90
Activations Density 0.113%