INDEX
Explanations
phrases related to instructions or rules, including directives and prohibitions
punctuation marks and specific structural elements of writing, particularly within lists or sections of text
New Auto-Interp
Negative Logits
GMT
-0.76
worm
-0.67
lock
-0.64
rosis
-0.63
sunset
-0.63
pipe
-0.63
nings
-0.62
rop
-0.62
starter
-0.62
raph
-0.60
POSITIVE LOGITS
who
1.06
whom
1.05
doms
0.94
Including
0.78
friends
0.77
groups
0.76
Especially
0.76
faiths
0.74
alike
0.72
shows
0.71
Activations Density 0.710%