INDEX
Explanations
specific occurrences of the word "the"
the start of a new text segment or paragraph
New Auto-Interp
Negative Logits
Ãĥ
-0.60
depending
-0.58
fare
-0.58
according
-0.57
[*
-0.56
accordingly
-0.55
—"
-0.54
according
-0.52
.*
-0.51
checks
-0.51
POSITIVE LOGITS
Beginning
0.66
BOX
0.64
cellar
0.63
foreground
0.62
Philippines
0.62
Conversation
0.61
Forums
0.61
Discussion
0.61
Languages
0.60
nutshell
0.60
Activations Density 0.187%