INDEX
Explanations
the beginning of new sections or shifts in context within the text
Text following punctuation or symbols
# followed by word
New Auto-Interp
Negative Logits
a
-0.68
CreateTagHelper
-0.67
posedge
-0.62
falfa
-0.60
-
-0.60
-,
-0.58
S
-0.56
per
-0.55
N
-0.55
/
-0.54
POSITIVE LOGITS
#
1.28
Jefus
0.99
Shakspeare
0.98
hashtag
0.97
ſelves
0.95
##
0.95
ſelf
0.94
Majefty
0.94
hashtags
0.93
/#
0.92
Activations Density 0.051%